Chemoreceptors in plant parasitic nematodes

ABSTRACT

Isolated DNA encoding a nematode guanylyl cyclase chemoreceptor is disclosed. Preferably, the encoded nematode guanylyl cyclase chemoreceptor is selected from the group consisting of order Tylenchida and order Aphelenchida chemoreceptors. Also disclosed are vectors and cells containing the DNA, the encoded proteins, oligonucleotides that bind thereto, and methods of using the same.

FIELD OF THE INVENTION

[0001] The present invention concerns isolated DNA encodingchemoreceptors of plant parasitic nematodes, cells that express suchDNA, the proteins so expressed, and methods of use thereof.

BACKGROUND OF THE INVENTION

[0002] Annual crop losses to plant-parasitic nematodes (soil-dwellingmicroscopic worms) are estimated to exceed 70 billion dollarsworld-wide. The soybean cyst nematode (SCN), Heterodera glycines, causesabout one billion in annual soybean losses in the United States alone.Environmental restrictions in the use of toxic nematicides andlimitations in available plant resistance schemes to nematodes haveprompted an urgent need for alternative management strategies to reducenematode-related damage in agriculture.

[0003] One way to control nematodes is by understanding and specificallyinterfering with the nematode's ability to locate and feed from plantroots. Like most plant-parasitic nematodes, infective juveniles of SCNmigrate in the soil and use their neurosensory organs to follow chemicalsignals emanating from host roots that they will attack

[0004] Chemoreceptor molecules have been identified in the modelnematode, Caenorhabditis elegans, as described in S. Yu et al., Proc.Natl. Acad. Sci. USA 94, 3384-3387 (1997). However, essentially nothingis known about putative chemoreceptors in plant-parasitic nematodes.Accordingly, there is a continued need for more information about thechemoreceptors of plant-parasitic nematodes.

SUMMARY OF THE INVENTION

[0005] A first aspect of the present invention is an isolated DNAencoding a nematode guanylyl cyclase chemoreceptor selected from thegroup consisting of: (a) isolated DNA having a nucleotide sequenceselected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ IDNO: 4, and SEQ ID NO: 6; (b) isolated DNA that hybridizes to DNA of (a)above under stringent conditions and encodes a nematode guanylyl cyclasechemoreceptor; and (c) isolated DNA that differs from the DNA of (a) or(b) above due to the degeneracy of the genetic code, and encodes anematode guanylyl cyclase chemoreceptor encoded by (a) or (b) above.

[0006] In one particular embodiment of the invention, the isolated DNAis selected from the group consisting of: (a) isolated DNA having anucleotide sequence selected from the group consisting of SEQ ID NO: 4and SEQ ID NO: 6; (b) isolated DNA that hybridizes to DNA of (a) aboveunder stringent conditions in which said isolated DNA does not hybridizeto DNA having a nucleotide sequence of SEQ ID NO: 1, and encodes anematode guanylyl cyclase chemoreceptor; and (c) isolated DNA thatdiffers from the DNA of (a) or (b) above due to the degeneracy of thegenetic code, and encodes a nematode guanylyl cyclase chemoreceptorencoded by (a) or (b) above.

[0007] In another particular embodiment of the invention, the isolatedDNA is selected from the group consisting of: (a) isolated DNA having anucleotide sequence selected from the group consisting of SEQ ID NO: 1and SEQ ID NO: 3; (b) isolated DNA that hybridizes to DNA of (a) aboveunder stringent conditions and encodes a nematode guanylyl cyclasechemoreceptor; and (c) isolated DNA that differs from the DNA of (a) or(b) above due to the degeneracy of the genetic code, and encodes anematode guanylyl cyclase chemoreceptor encoded by (a) or (b) above.

[0008] In another particular embodiment of the invention, the isolatedDNA is selected from the group consisting of: (a) isolated DNA having anucleotide sequence selected from the group consisting of SEQ ID NO: 4;(b) isolated DNA that hybridizes to DNA of (a) above under stringentconditions and encodes a nematode guanylyl cyclase chemoreceptor; and(c) isolated DNA that differs from the DNA of (a) or (b) above due tothe degeneracy of the genetic code, and encodes a nematode guanylylcyclase chemoreceptor encoded by (a) or (b) above.

[0009] In still another particular embodiment of the invention, theisolated DNA is selected from the group consisting of: (a) isolated DNAhaving a nucleotide sequence selected from the group consisting of SEQID NO: 6; (b) isolated DNA that hybridizes to DNA of (a) above understringent conditions and encodes a nematode guanylyl cyclasechemoreceptor; and (c) isolated DNA that differs from the DNA of (a) or(b) above due to the degeneracy of the genetic code, and encodes anematode guanylyl cyclase chemoreceptor encoded by (a) or (b) above.

[0010] Preferably, the encoded nematode guanylyl cyclase chemoreceptoris selected from the group consisting of order Tylenchida and orderAphelenchida chemoreceptors.

[0011] A second aspect of the invention is an oligonucleotide thatspecifically binds to isolated DNA as described above is a furtheraspect of the invention. Such an oligonucleotide may comprise DNA orRNA, or may be a synthetic oligonucleotide.

[0012] A third aspect of the invention is an antisense oligonucleotidethat specifically binds to an mRNA transcript of a DNA as describedabove, along with DNAs that encode such antisense oligonucleotides.

[0013] A fourth aspect of the invention double-stranded RNA that iscomplementary to a DNA as described above and interferes with theexpression thereof in a cell that expresses the encoded protein.

[0014] A fifth aspect of the invention is an expression cassettecomprising a DNA as described above and a heterologous promoteroperatively associated therewith, along with cells that contain suchexpression cassettes and express the encoded nematode guanylyl cyclasechemoreceptor (e.g., yeast cells, plant cells, insect cells).

[0015] A sixth aspect of the invention is an isolated nematode guanylylcyclase chemoreceptor protein encoded by a DNA as described above (aprotein of the invention), along with proteins or peptides (e.g.,antibodies) that specifically bind to such nematode guanylyl cyclasechemoreceptor proteins.

[0016] A seventh aspect of the present invention is a method ofscreening a compound for the ability to disrupt plant parasitic nematodefeeding or chemotaxis, said method comprising: determining whether ornot said compound selectively binds to a nematode guanylyl cyclasechemoreceptor protein encoded by a DNA as described above. The presenceof such binding indicating said compound is useful in disrupting plantparasitic nematode feeding or chemotaxis.

[0017] The foregoing and other objects and aspects of the presentinvention are explained in detail in the specification set forth below.

BRIEF DESCRIPTION OF THE DRAWINGS

[0018]FIG. 1 provides a comparison of the guanylyl cyclase (gcy) domainsof HG-gcy-1, HG-gcy-2, and HG-gcy-3 to various other guanylyl cyclasesand proteins.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0019] Amino acid sequences disclosed herein are presented in the aminoto carboxy direction, from left to right. The amino and carboxy groupsare not presented in the sequence. Nucleotide sequences are presentedherein by single strand only, in the 5′ to 3′ direction, from left toright. Nucleotides and amino acids are represented herein in the mannerrecommended by the IUPAC-IUB Biochemical Nomenclature Commission; or(for amino acids) by three letter code, in accordance with 37 CFR §1.822 and established usage.

[0020] The present invention may be carried out with plant parasitic, orplant feeding, nematodes. That is, chemoreceptors proteins of thepresent invention may be those of such nematodes, and isolated DNA maybe isolated, directly or indirectly, from such nematodes. Examples ofplant parasitic nematodes include, but are not limited to, cystnematodes (Heteroderidae spp.), root knot nematodes (Meloidogyne spp.),lesion nematodes (Pratylenchus spp.), and reniform nematodes(Rotylenchulus spp.). In general, nematodes of the orders Tylenchida andAphelenchida are preferred, particularly nematodes of the orderTylenchida. Nematodes of the Heteroderoidea superfamily are particularlypreferred, the Heterodae family more preferred, and the genus Heteroderamost preferred.

[0021] The production of cloned genes, recombinant DNA, vectors,transformed host cells, proteins and protein fragments by genetic-engineering is discussed in greater detail below. It will beappreciated, however, that the techniques employed in carrying out theinstant invention are well known. See, e.g., U.S. Pat. No. 4,761,371 toBell et al. at Col. 6 line 3 to Col. 9 line 65; U.S. Pat. No. 4,877,729to Clark et al. at Col. 4 line 38 to Col. 7 line 6; U.S. Pat. No.4,912,038 to Schilling at Col. 3 line 26 to Col. 14 line 12; and U.S.Pat. No. 4,879,224 to Wallner at Col. 6 line 8 to Col. 8 line 59.(Applicant specifically intends that the disclosure of all patentreferences cited herein be incorporated herein in their entirety byreference).

[0022] 1. Isolated Nucleic Acids.

[0023] Isolated DNA of the present invention can be of any species oforigin, but is preferably isolated either directly or indirectly fromplant parasitic nematodes as described above. Thus, polynucleotides thathybridize to DNA disclosed herein as SEQ ID NO:1, SEQ ID NO: 3, SEQ IDNO: 4, and/or SEQ ID NO: 6 (or fragments or derivatives thereof whichserve as hybridization probes as discussed below) and which code onexpression for a protein of the present invention (e.g., a proteinaccording to SEQ ID NO:2), are also an aspect of the present invention.

[0024] Conditions which will permit other polynucleotides that code onexpression for a protein of the present invention to hybridize to theaforesaid DNA can be determined in accordance with known techniques. Forexample, hybridization of such sequences may be carried out underconditions of reduced stringency, medium stringency or even stringentconditions (e.g., conditions represented by a wash stringency of 35-40%Formamide with 5× Denhardt's solution, 0.5% SDS and 1×SSPE at 37° C.;conditions represented by a wash stringency of 40-45% Formamide with 5×Denhardt's solution, 0.5% SDS, and 1×SSPE at 42° C.; and conditionsrepresented by a wash stringency of 50% Formamide with 5× Denhardt'ssolution, 0.5% SDS and 1×SSPE at 42° C., respectively) to the aforesaidDNA in a standard hybridization assay. See, e.g., J. Sambrook et al.,Molecular Cloning, A Laboratory Manual (2d Ed. 1989) (Cold Spring HarborLaboratory). In general, sequences which code for proteins of thepresent invention and which hybridize to the DNAs disclosed herein willbe at least 60% homologous, 70% homologous, 80% homologous and even 90%homologous or more with SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO: 4, or SEQID NO: 6, respectively.

[0025] As noted above, in one particular embodiment of the invention,the isolated DNA is selected from the group consisting of: (a) isolatedDNA having a nucleotide sequence selected from the group consisting ofSEQ ID NO: 4 and SEQ ID NO: 6; (b) isolated DNA that hybridizes to DNAof (a) above under stringent conditions in which said isolated DNA doesnot hybridize to DNA having a nucleotide sequence of SEQ ID NO: 1, andencodes a nematode guanylyl cyclase chemoreceptor; and (c) isolated DNAthat differs from the DNA of (a) or (b) above due to the degeneracy ofthe genetic code, and encodes a nematode guanylyl cyclase chemoreceptorencoded by (a) or (b) above. In this embodiment, the stringency of thewash conditions is routinely determined by adjusting wash stringencyupward until the stringency meets the test of excluding hybridization toa nucleotide sequence of SEQ ID NO: 1.

[0026] Further, polynucleotides that code for proteins of the presentinvention, or polynucleotides that hybridize to nucleotides having asequence as given in SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO: 4, and/or SEQID NO: 6, as described above, but which differ in codon sequence fromthe given coding sequence due to the degeneracy of the genetic code, arealso an aspect of this invention. The degeneracy of the genetic code,which allows different nucleic acid sequences to code for the sameprotein or peptide, is well known in the literature. See, e.g., U.S.Pat. No. 4,757,006 to Toole et al. at Col. 2, Table 1.

[0027] The invention also encompasses production of DNA sequences, orfragments thereof, which encode proteins of the invention, entirely bysynthetic chemistry. Where modeled after the natural protein or DNA,such sequences may be considered to have been indirectly isolated fromthe species carrying the naturally occurring nucleotide or protein.After production, the synthetic sequence may be inserted into any of themany available expression vectors and cell systems using reagents thatare well known in the art.

[0028] In general, those skilled in the art will appreciate that minordeletions or substitutions may be made to the amino acid sequences ofpeptides of the present invention without unduly adversely affecting theactivity thereof. Thus, peptides containing such deletions orsubstitutions are a further aspect of the present invention. In peptidescontaining substitutions or replacements of amino acids, one or moreamino acids of a peptide sequence may be replaced by one or more otheramino acids wherein such replacement does not affect the function ofthat sequence. Such changes can be guided by known similarities betweenamino acids in physical features such as charge density,hydrophobicity/hydrophilicity, size and configuration, so that aminoacids are substituted with other amino acids having essentially the samefunctional properties.

[0029] 2. Oligonucleotides.

[0030] The term “oligonucleotide” refers to a nucleic acid sequence ofat least about 6 nucleotides to about 60 nucleotides, preferably about15 to 30 nucleotides, and more preferably about −20 to 25 nucleotides,which can be used in PCR amplification or a hybridization assay, or amicroarray. As used herein, oligonucleotide includes “amplifiers”,“primers”, “oligomers”, and “probes”, as commonly defined in the art.

[0031] Knowledge of the nucleotide sequences disclosed herein can beused to generate hybridization probes which specifically bind to the DNAof the present invention or to mRNA produced by the transcription ofsuch nucleotides to determine the presence of, amplify, or determine theoverexpression of the proteins of the present invention. Theoligonucleotides may also be used as active agents for the control ofplant feeding nematodes as described above.

[0032] A label or detectable group may be conjugated to theoligonucleotide, if desired. A wide variety of labels and conjugationtechniques are known by those skilled in the art and may be used invarious nucleic acid and amino acid assays. Means for producing labeledhybridization or PCR probes for detecting sequences related to proteinsof the invention oligolabeling, nick translation, end-labeling or PCRamplification using a labeled nucleotide. Alternatively, the sequencesencoding proteins of the invention, or any fragments thereof may becloned into a vector for the production of an mRNA probe. Such vectorsare known in the art, are commercially available, and may be used tosynthesize RNA probes in vitro by addition of an appropriate RNApolymerase such as T7, T3, or SP6 and labeled nucleotides. Theseprocedures may be conducted using a variety of commercially availablekits (Pharmacia & Upjohn, (Kalamazoo, Mich.); Promega (Madison Wis.);and U.S. Biochemical Corp., Cleveland, Ohio)). Suitable reportermolecules or labels, which may be used for ease of detection, includeradionuclides, enzymes, fluorescent, chemiluminescent, or chromogenicagents as well as substrates, cofactors, inhibitors, magnetic particles,and the like.

[0033] Assays for detecting the nucleotides of the invention, or theextent of amplification thereof, typically involve, first, contactingthe cells or extracts of the cells containing nucleic acids therefromwith an oligonucleotide that specifically binds to proteins of theinvention under conditions that permit access of the oligonucleotide tointracellular material, and then detecting the presence or absence ofbinding of the oligonucleotide thereto. Any suitable assay format may beemployed (see, e.g., U.S. Pat. No. 4,358,535 to Falkow et al.; U.S. Pat.Nos. 4,302,204 to Wahl et al.; 4,994,373 to Stavrianopoulos et al;4,486,539 to Ranki et al.; 4,563,419 to Ranki et al.; and 4,868,104 toKurn et al.) (the disclosures of which applicant specifically intends beincorporated herein by reference).

[0034] 3. Expression Vectors and Transgenic Cell Lines.

[0035] A vector is a replicable DNA construct. Vectors are used hereineither to amplify DNA encoding the proteins of the present invention orto express the proteins of the present invention. An expression vectoris a replicable DNA construct in which a DNA sequence encoding theproteins of the present invention is operably linked to suitable controlsequences capable of effecting the expression of proteins of the presentinvention in a suitable host. The need for such control sequences willvary depending upon the host selected and the transformation methodchosen. Generally, control sequences include a transcriptional promoter,an optional operator sequence to control transcription, a sequenceencoding suitable mRNA ribosomal binding sites, and sequences whichcontrol the termination of transcription and translation. Amplificationvectors do not require expression control domains. All that is needed isthe ability to replicate in a host, usually conferred by an origin ofreplication, and a selection gene to facilitate recognition oftransformants.

[0036] Vectors comprise plasmids, viruses (e.g., adenovirus,cytomegalovirus), phage, retroviruses and integratable DNA fragments(i.e., fragments integratable into the host genome by recombination).The vector replicates and functions independently of the host genome, ormay, in some instances, integrate into the genome itself. Expressionvectors should contain a promoter and RNA binding sites which areoperably linked to the gene to be expressed and are operable in the hostorganism.

[0037] DNA regions are operably linked or operably associated when theyare functionally related to each other. For example, a promoter isoperably linked to a coding sequence if it controls the transcription ofthe sequence; a ribosome binding site is operably linked to a codingsequence if it is positioned so as to permit translation. Generally,operably linked means contiguous and, in the case of leader sequences,contiguous and in reading phase.

[0038] Transformed host cells are cells which have been transformed ortransfected with vectors containing DNA coding for proteins of thepresent invention need not express protein.

[0039] Suitable host cells include prokaryotes, yeast cells, or highereukaryotic organism cells. Prokaryote host cells include gram negativeor gram positive organisms, for example Escherichia coli (E. coli) orBacilli. Higher eukaryotic cells include established cell lines ofmammalian origin as described below. Exemplary host cells are E. coliW3110 (ATCC 27,325), E. coli B, E. coli X1776 (ATCC 31,537), E. coli 294(ATCC 31,446). A broad variety of suitable prokaryotic and microbialvectors are available. E. coli is typically transformed using pBR322.See Bolivar et al., Gene 2, 95 (1977). Promoters most commonly used inrecombinant microbial expression vectors include the beta-lactamase(penicillinase) and lactose promoter systems (Chang et al., Nature 275,615 (1978); and Goeddel et al., Nature 281, 544 (1979), a tryptophan(trp) promoter system (Goeddel et al., Nucleic Acids Res. 8, 4057 (1980)and EPO App. Publ. No. 36,776) and the tac promoter (H. De Boer et al.,Proc. Natl. Acad. Sci. USA 80, 21 (1983). The promoter andShine-Dalgarno sequence (for prokaryotic host expression) are operablylinked to the DNA of the present invention, i.e., they are positioned soas to promote transcription of the messenger RNA from the DNA.

[0040] Expression vectors should contain a promoter which is recognizedby the host organism. This generally means a promoter obtained from theintended host. Promoters most commonly used in recombinant microbialexpression vectors include the beta-lactamase (penicillinase) andlactose promoter systems (Chang et al., Nature 275, 615 (1978); andGoeddel et al., Nature 281, 544 (1979), a tryptophan (trp) promotersystem (Goeddel et al., Nucleic Acids Res. 8, 4057 (1980) and EPO App.Publ. No. 36,776) and the tac promoter (H. De Boer et al., Proc. Natl.Acad. Sci. USA 80, 21 (1983). While these are commonly used, othermicrobial promoters are suitable. Details concerning nucleotidesequences of many have been published, enabling a skilled worker tooperably ligate them to DNA encoding the protein in plasmid or viralvectors (Siebenlist et al., Cell 20, 269 (1980). The promoter andShine-Dalgarno sequence (for prokaryotic host expression) are operablylinked to the DNA encoding the desired protein, i.e., they arepositioned so as to promote transcription of the protein messenger RNAfrom the DNA.

[0041] Eukaryotic microbes such as yeast cultures may be transformedwith suitable protein-encoding vectors. See e.g., U.S. Pat. No.4,745,057. Saccharomyces cerevisiae is the most commonly used amonglower eukaryotic host microorganisms, although a number of other strainsare commonly available. Yeast vectors may contain an origin ofreplication from the 2 micron yeast plasmid or an autonomouslyreplicating sequence (ARS), a promoter, DNA encoding the desiredprotein, sequences for polyadenylation and transcription termination,and a selection gene. An exemplary plasmid is YRp7, (Stinchcomb et al.,Nature 282, 39 (1979); Kingsman et al., Gene 7, 141 (1979); Tschemper etal., Gene 10, 157 (1980). This plasmid contains the trp1 gene, whichprovides a selection marker for a mutant strain of yeast lacking theability to grow in tryptophan, for example ATCC No. 44076 or PEP4-1(Jones, Genetics 85, 12 (1977). The presence of the trp1 lesion in theyeast host cell genome then provides an effective environment fordetecting transformation by growth in the absence of tryptophan.

[0042] Suitable promoting sequences in yeast vectors include thepromoters for metallothionein, 3-phospho-glycerate kinase (Hitzeman etal., J. Biol. Chem. 255, 2073 (1980) or other glycolytic enzymes (Hesset al., J. Adv. Enzyme Reg. 7, 149 (1968); and Holland et al.,Biochemistry 17, 4900 (1978), such as enolase,glyceraldehyde-3-phosphate dehydrogenase, hexokinase, pyruvatedecarboxylase, phosphofructokinase, glucose-6-phosphate isomerase,3-phosphoglycerate mutase, pyruvate kinase, triosephosphate isomerase,phosphoglucose isomerase, and glucokinase. Suitable vectors andpromoters for use in yeast expression are further described in R.Hitzeman et al., EPO Publn. No. 73,657.

[0043] Cultures of cells derived from multicellular organisms are adesirable host for recombinant protein synthesis. In principal, anyhigher eukaryotic cell culture is workable, whether from vertebrate orinvertebrate culture, including insect cells. Propagation of such cellsin cell culture has become a routine procedure. See Tissue Culture,Academic Press, Kruse and Patterson, editors (1973). Examples of usefulhost cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO)cell lines, and WI138, BHK, COS-7, CV, and MDCK cell lines. Expressionvectors for such cells ordinarily include (if necessary) an origin ofreplication, a promoter located upstream from the gene to be expressed,along with a ribosome binding site, RNA splice site (ifintron-containing genomic DNA is used), a polyadenylation site, and atranscriptional termination sequence.

[0044] The transcriptional and translational control sequences inexpression vectors to be used in transforming vertebrate cells are oftenprovided by viral sources. For example, commonly used promoters arederived from polyoma, Adenovirus 2, and Simian Virus 40 (SV40). See,e.g., U.S. Pat. No. 4,599,308. The early and late promoters are usefulbecause both are obtained easily from the virus as a fragment which alsocontains the SV40 viral origin of replication. See Fiers et al., Nature273, 113 (1978). Further, the protein promoter, control and/or signalsequences, may also be used, provided such control sequences arecompatible with the host cell chosen.

[0045] An origin of replication may be provided either by constructionof the vector to include an exogenous origin, such as may be derivedfrom SV40 or other viral source (e.g. Polyoma, Adenovirus, VSV, or BPV),or may be provided by the host cell chromosomal replication mechanism.If the vector is integrated into the host cell chromosome, the lattermay be sufficient.

[0046] Host cells such as insect cells (e.g., cultured Spodopterafrugiperda cells) and expression vectors such as the baculorivusexpression vector (e.g., vectors derived from Autographa californicaMNPV, Trichoplusia ni MNPV, Rachiplusia ou MNPV, or Galleria ou MNPV)may be employed to make proteins usefull in carrying out the presentinvention, as described in U.S. Pat. Nos. 4,745,051 and 4,879,236 toSmith et al. In general, a baculovirus expression vector comprises abaculovirus genome containing the gene to be expressed inserted into thepolyhedrin gene at a position ranging from the polyhedrintranscriptional start signal to the ATG start site and under thetranscriptional control of a baculovirus polyhedrin promoter.

[0047] In mammalian host cells, a number of viral-based expressionsystems may be utilized. In cases where an adenovirus is used as anexpression vector, sequences encoding proteins of the invention may beligated into an adenovirus transcription/translation complex consistingof the late promoter and tripartite leader sequence. Insertion in anon-essential E1 or E3 region of the viral genome may be used to obtaina viable virus which is capable of expressing proteins of the inventionin infected host cells (Logan, J. and Shenk, T. (1984) Proc. Natl. Acad.Sci. 81:3655-3659). In addition, transcription enhancers, such as theRous sarcoma virus (RSV) enhancer, may be used to increase expression inmammalian host cells.

[0048] Rather than using vectors which contain viral origins ofreplication, one can transform mammalian cells by the method ofcotransformation with a selectable marker and the chimeric protein DNA.An example of a suitable selectable marker is dihydrofolate reductase(DHFR) or thymidine kinase. See U.S. Pat. No. 4,399,216. Such markersare proteins, generally enzymes, that enable the identification oftransformant cells, i.e., cells which are competent to take up exogenousDNA. Generally, identification is by survival or transformants inculture medium that is toxic, or from which the cells cannot obtaincritical nutrition without having taken up the marker protein.

[0049] 4. Antibodies and Other Binding Proteins and Peptides.

[0050] As used herein, the term “antibody” refers to intact molecules aswell as fragments thereof, such as Fa, F(ab′)₂, and Fc, which arecapable of binding the epitopic determinant. Antibodies that bindproteins of the invention can be prepared using intact proteins orfragments containing small peptides of interest as the immunizingantigen. The polypeptide or oligopeptide used to immunize an animal canbe derived from the translation of RNA or synthesized chemically and canbe conjugated to a carrier protein, if desired. Commonly used carriersthat are chemically coupled to peptides include bovine serum albumin andthyroglobulin, keyhole limpet hemocyanin. The coupled peptide is thenused to immunize the animal (e.g., a mouse, a rat, or a rabbit) fromwhich antibodies or spleen cells are collected.

[0051] Antibodies that specifically bind to the proteins of the presentinvention (i.e., antibodies which bind to a single antigenic site orepitope on the proteins) are useful for a variety of diagnosticpurposes.

[0052] Antibodies to proteins of the invention may be generated usingmethods that are well known in the art. Such antibodies may include, butare not limited to, polyclonal, monoclonal, chimeric, single chain, Fabfragments, and fragments produced by a Fab expression library.Neutralizing antibodies, (i.e., those which inhibit dimer formation) areespecially preferred for therapeutic use.

[0053] For the production of antibodies, various hosts including goats,rabbits, rats, mice, humans, and others, may be immunized by injectionwith a protein of the invention or any fragment or oligopeptide thereofwhich has immunogenic properties. Depending on the host species, variousadjuvants may be used to increase immunological response. Such adjuvantsinclude, but are not limited to, Freund's, mineral gels such as aluminumhydroxide, and surface active substances such as lysolecithin, pluronicpolyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin,and dinitrophenol. Among adjuvants used in humans, BCG (bacilliCalmette-Guerin) and Corynebacterium parvum are especially preferable.

[0054] Monoclonal antibodies to proteins of the invention may beprepared using any technique which provides for the production ofantibody molecules by continuous cell lines in culture. These include,but are not limited to, the hybridoma technique, the human B-cellhybridoma technique, and the EBV-hybridoma technique (Kohler, G. et al.(1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods81:31-42; Cote, R. J. et al. (1983) Proc. Natl. Acad. Sci. 80:2026-2030;Cole, S. P. et al. (1984) Mol. Cell Biol. 62:109-120).

[0055] Antibodies may also be produced by inducing in vivo production inthe lymphocyte population or by screening immunoglobulin libraries orpanels of highly specific binding reagents as disclosed in theliterature (Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. 86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299).

[0056] Antibody fragments which contain specific binding sites forproteins of the invention may also be generated. For example, suchfragments include, but are not limited to, the F(ab′)₂ fragments whichcan be produced by pepsin digestion of the antibody molecule and the Fabfragments which can be generated by reducing the disulfide bridges ofthe F(ab′)₂ fragments. Alternatively, Fab expression libraries may beconstructed to allow rapid and easy identification of monoclonal Fabfragments with the desired specificity (Huse, W. D. et al. (1989)Science 254:1275-1281).

[0057] Various immunoassays may be used for screening to identifyantibodies having the desired specificity. Numerous protocols forcompetitive binding or immunoradiometric assays using either polyclonalor monoclonal antibodies with established specificities are well knownin the art. Such immunoassays typically involve the measurement ofcomplex formation between a protein of the invention and its specificantibody.

[0058] Antibodies may be conjugated to a solid support suitable for adiagnostic assay (e.g., beads, plates, slides or wells formed frommaterials such as latex or polystyrene) in accordance with knowntechniques, such as precipitation. Antibodies may likewise be conjugatedto detectable groups such as radiolabels (e.g., ³⁵S, ¹²⁵I, ¹³¹I, enzymelabels (e.g., horseradish peroxidase, alkaline phosphatase), andfluorescent labels (e.g., fluorescein) in accordance with knowntechniques.

[0059] Kits for determining if a sample contains proteins of the presentinvention will include at least one reagent specific for detecting thepresence or absence of the protein. Diagnostic kits for carrying outantibody assays may be produced in a number of ways. In one embodiment,the diagnostic kit comprises (a) an antibody which binds proteins of thepresent invention conjugated to a solid support and (b) a secondantibody which binds proteins of the present invention conjugated to adetectable group. The reagents may also include ancillary agents such asbuffering agents and protein stabilizing agents, e.g., polysaccharidesand the like. The diagnostic kit may further include, where necessary,other members of the signal-producing system of which system thedetectable group is a member (e.g., enzyme substrates), agents forreducing background interference in a test, control reagents, apparatusfor conducting a test, and the like. A second embodiment of a test kitcomprises (a) an antibody as above, and (b) a specific binding partnerfor the -antibody conjugated to a detectable group. Ancillary agents asdescribed above may likewise be included. The test kit may be packagedin any suitable manner, typically with all elements in a singlecontainer along with a sheet of printed instructions for carrying outthe test.

[0060] Binding proteins or peptides other than antibodies, and bindingcompounds other than proteins or peptides, can be identified byscreening combinatorial libraries of such compounds with the screeningassays described below.

[0061] 5. Screening Assays.

[0062] As noted above, the present invention provides methods ofscreening a compound for the ability to disrupt plant parasitic nematodefeeding and/or chemotaxis. The methods comprise determining whether ornot that compound selectively binds to a nematode guanylyl cyclasechemoreceptor protein encoded by a DNA as described herein. The presenceof such binding indicates the compound is useful in disrupting plantparasitic nematode feeding and/or chemotaxis. The determining step maybe carried out in vitro with a cell membrane preparation containing theproteins produced from recombinant cells as described above, or even ina membrane-free preparation. Alternatively, the determining step may becarried out in vivo in a cell culture comprising cells that express theprotein, in accordance with known techniques.

[0063] The compound screened may be a member of a combinatorial library,which generally are comprised of non-oligomers, oligomers, orcombinations thereof. Non-oligomer combinatorial libraries include awide variety of organic molecules, such as heterocyclics, aromatics,alicyclics, aliphatics and combinations thereof, comprising steroids,antibiotics, enzyme inhibitors, ligands, hormones, drugs, alkaloids,opioids, terpenes, porphyrins, toxins, catalysts, as well ascombinations thereof.

[0064] Oligomer combinatorial libraries include oligopeptides,oligonucleotides, oligosaccharides, polylipids, polyesters, polyamides,polyurethanes, polyureas, polyethers, and poly (phosphorus derivatives),e.g. phosphates, phosphonates, phosphoramides, phosphonamides,phosphites, phosphinamides, etc., poly (sulfur derivatives) e.g.,sulfones, sulfonates, sulfites, sulfonamides, sulfenamides, etc., wherefor the phosphorous and sulfur derivatives the indicated heteroatom forthe most part will be bonded to C,H,N,O or S, and combinations thereof.

[0065] When the compound to be screened is a member of a combinatoriallibrary, the screening step may be incorporated into a high throughputscreening procedure in accordance with known techniques. In this case,the members of the combinatorial library may be immobilized on solidsupports, which solid supports may be separate from one another (e.g.,particles or beads) as described in U.S. Pat. No. 5,656,324 to Still etal., or may be discrete regions on a surface portion of a unitarysubstrate. Such “chip-type” or “pin-type” solid supports are known. See,e.g., U.S. Pat. No. 5,288,514 to Ellman (pin-based support); U.S. Pat.No. 5,510,270 to Fodor et al. (chip-based support). In addition, thescreening step may be carried out with any other suitable combinatoriallibrary technique, including but not limited to phage display. See,e.g., U.S. Pat. No. 5,812,047 to Garrard et al.; U.S. Pat. No. 5,223,409to Ladner et al.; U.S. Pat. No. 5,498,538 to Kay & Fowlkes; U.S. Pat.No. 4,953,002 to Dulbecco.

[0066] One of the principal assays to determine efficacy of potentialinhibitors of chemosensory receptors will be to transform plant cellsand tissues with genes encoding chemosensory inhibitor molecules (likethose mentioned in this application, preferably peptides) and test theireffect on nematode chemotaxis. Expression of cassettes producing encodedinhibitory molecules that may be retained, or preferably designed to beexuded from plant tissues (i.e. roots), may be under the control ofconstitutive promoters such as CaMV 35S, Ro1A-D, nopaline synthase,gamma-TIP, T-cyt, and TR2′, or inducible promoters such as those derivedfrom expressed plant genes like TobRB7, cdc2At, and wun1 (H. Atkinson etal., in The Physiology and Biochemistry of Free-Living andPlant-Parasitic Nematodes, pp. 382-413 (ed. R N Perry, D J Wright,1998); G. Gheysen et al., in Cellular and Molecular Aspects ofPlant-Nematode Interactions, pp. 120-132 (ed. C Fenoll, F M W Grundler,S A Ohl, 1997); A. Goverse et al., Physiol. Mol. Plant Pathol.52:275-284 (1998)). Transformation of plant cells and tissues may beconducted using Agrobacterium tumefascians, Agrobacterium rhizogenes, orbiolistic approaches (Atkinson et al., 1998). Transformation of plantroots via A. rhizogenes is preferable for screening purposes since thesesystems work in many plant species and have been demonstrated to producea reproducible and scorable plant-nematode interaction (D. Cai et al.,Science 275, 832-834 (1997); M. Savka et al., Phytopathology 80, 503-508(1990)). A variety of agar and soil-based assays may be utilized toassess the effect of transgenic expression of chemosensory inhibitors inplants on nematode chemotactic ability (C. Bargmann and I. Mori.Chemotaxis and thermotaxis. Pp. 717-737 In Riddle, D. L., T. Blumenthal,B. J. Meyer, and J. R. Priess, eds., C. elegans II, (Cold Spring HarborLaboratory Press, NY 1997)).

[0067] 6. Antisense Oligonucleotides and Double-stranded RNA.

[0068] Antisense oligonucleotides. The term “antisense”, as used herein,refers to any composition containing nucleotide sequences which arecomplementary to a specific DNA or RNA sequence. The term “antisensestrand” is used in reference to a nucleic acid strand that iscomplementary to the “sense” strand. Antisense molecules include peptidenucleic acids and may be produced by any method including synthesis ortranscription. Once introduced into a cell, the complementarynucleotides combine with natural sequences produced by the cell to formduplexes and block either transcription or translation. The designation“negative” is sometimes used in reference to the antisense strand, and“positive” is sometimes used in reference to the sense strand.

[0069] Antisense oligonucleotides and nucleic acids that express thesame may be made in accordance with conventional techniques. See, e.g.,U.S. Pat. No. 5,023,243 to Tullis; U.S. Pat. No. 5,149,797 to Pedersonet al. The length of the antisense oligonucleotide (i.e., the number ofnucleotides therein) is not critical so long as it binds selectively tothe intended location, and can be determined in accordance with routineprocedures. In general, the antisense oligonucleotide will be from 8, 10or 12 nucleotides in length up to 20, 30, or 50 nucleotides in length.Such antisense oligonucleotides may be oligonucleotides wherein at leastone, or all, or the internucleotide bridging phosphate residues aremodified phosphates, such as methyl phosphonates, methylphosphonothioates, phosphoromorpholidates, phosphoro-piperazidates andphosphoramidates. For example, every other one of the internucleotidebridging phosphate residues may be modified as described. In anothernon-limiting example, such antisense oligonucleotides areoligonucleotides wherein at least one, or all, of the nucleotidescontain a 2′ loweralkyl moiety (e.g., C₁-C₄, linear or branched,saturated or unsaturated alkyl, such as methyl, ethyl, ethenyl, propyl,1-propenyl, 2-propenyl, and isopropyl). For example, every other one ofthe nucleotides may be modified as described. See also P. Furdon et al.,Nucleic Acids Res. 17, 9193-9204 (1989); S. Agrawal et al., Proc. Natl.Acad Sci. USA 87, 1401-1405 (1990); C.

[0070] Baker et al., Nucleic Acids Res. 18, 3537-3543 (1990); B. Sproatet al., Nucleic Acids Res. 17, 3373-3386 (1989); R. Walder and J.Walder, Proc. Natl. Acad. Sci. USA 85, 5011-5015 (1988).

[0071] Antisense oligonucleotides may be used as biological controlagents per se, or DNA encoding such antisense oligonucleotides may beprovided in an expression cassette which is capable of infecting a hostnematode and transforming cells of the same, which expression cassettemay in turn be used as a biological control agent.

[0072] RNA interference (RNAi). RNAi is a methodology to directlyinhibit gene activity that is both powerful and efficient is thedouble-stranded (ds) RNA-mediated interference (RNAi) of gene expressionas demonstrated in C. elegans (A. Fire et al., Potent and specificgenetic interference by double-stranded RNA in Caenorhabditis elegans,Nature 391, 806-11 (1998); L. Timmons and A. Fire, Specific interferenceby ingested dsRNA. Nature 395, 854 (1998)). In this methodology, dsRNAcomplementary to a gene-of-interest (in this case, a DNA as describedabove) is administered to (e.g., injected into or ingested by) thetarget nematode. As a consequence, activity of the gene-of-interest istransiently abolished in the treated animal. Thus such agents are usefulin the control of nematodes as described above. Two distinct advantagesprovided by RNAi analyses include; the dsRNA does not have to beinjected into the nematode germ line to exert inhibitory effects intissues distal to the injection site (i. e. RNAi does not requiresuccessful transformation); and the inhibitory effects of injected dsRNAcan be realized in one or more subsequent nematode generations derivedfrom the treated parent. RNAi designed to knock-out gene function innematodes can be assayed directly for its effects on chemosensorybehavior. It also will be important to monitor the effects RNAi by mRNAin situ hybridization and/or antibody probes to the target gene productto confirm inhibition. The dsRNA is, in general, from 8, 10 or 12nucleotides in length up to 20, 30, or 50 nucleotides in length, eachstrand (although the strands do not have to be identical in length).

[0073] 7. Control of Plant Parasitic Nematodes.

[0074] The present invention provides a variety of means for controllingplant parasitic nematodes, as described above.

[0075] Nematodes may be administered an expression cassette thatcontains and expresses a DNA as described thereof, or a fragment thereofof a length sufficient to induce silencing (e.g., at least partialsilencing) of expression of the guanylyl cyclase chemoreceptor in thenematode, and thereby disrupt feeding or chemotaxis of the nematode.

[0076] Nematodes may be administered an antisense oligonucleotide asdescribed above, either per se or through a vector that expresses theantisense oligonucleotide, in an amount sufficient to disrupt feeding orchemotaxis of the nematode.

[0077] Nematodes may be administered an oligonucleotide as describedabove (such as an RNAi oligonucleotide as described above) in an amountsufficient to disrupt feeding or chemotaxis of the nematode.

[0078] Nematodes may be administered a protein or peptide (e.g., anantibody) that specifically binds to the guanylyl cyclase chemoreceptorproteins disclosed hereinabove, in an amount sufficient to disrupt thefeeding or chemotaxis of the nematode. Such proteins and peptides,including antibodies, are readily produced in the manner describedabove.

[0079] Administration of the active compounds described above may be byany suitable means, such as by spraying crops or plants with the activeagents described above, by treating soil with the active agentsdescribed above, etc. The active agents may be combined with a suitableagricultural carrier, including aqueous carriers, nonaqueous carriers,emulsions, dry powders, etc., which may optionally include stickers,adjuvants and the like, all in accordance with standard techniques, withthe active agent being included in any suitable amount (e.g., from 0.001to 99 percent by weight of the total composition).

[0080] The present invention is explained in greater detail in thefollowing non-limiting Examples.

EXAMPLE 1 Identification of HG-gcy-1

[0081] The soybean cyst nematode (SCN), Heterodera glycines guanylylcyclase −1 (HG-gcy-1) coding sequence was first located by the instantinventors about 1 kb upstream of β-1,4-endoglucanase-1 precursor duringa study of the organization of β-1,4-endoglucanase gene family in thesoybean cyst nematode. The full-length chemosensory guanylyl cyclasegene HG-gcy-1 was generated by further Lambda genomic clone mapping ,oligo(dT) cDNA library screening and 5′RACE. A partial cDNA clone ofHg-gcy-1 was obtained by screening a SCN cDNA library and the partialHG-gcy-1 cDNA sequence was released in GenBank on Nov. 9, 1998 (GenBankaccession number AF095746). The HG-gcy-1 full-length cDNA sequence (3762bp) (SEQ ID NO: 1) was released in GenBank on Mar. 5, 1999 with anaccession number (AF095746).

[0082] By comparing the HG-gcy-1 genomic sequence (SEQ ID NO: 3) andcDNA sequence, 24 introns were identified in HG-gcy-1 gene. Thepredicted protein of HG-gcy-1 (SEQ ID NO: 2) had strong homology to afamily of guanylyl cyclase chemoreceptors reported in the nematodeCaenorhabditis elegans. We have recently localized transcripts ofHG-gcy-1 in SCN chemosensory cells by mRNA in situ hybridization (datanot reported).

EXAMPLE 2 Identification of HG-gcy-2 and HG-gcy-3

[0083] A DNA DIG labeled probe was synthesized based on the guanylylcyclase catalytic domain sequence. The primers used to synthesize theprobe was cycleExp3 and cycleExp4. The cycleExp3 primer sequence is:

[0084] AGCGGATCCCGTCCGCGCATGGACATTGTG (SEQ ID NO: 8).

[0085] The cycleExp4 prime sequence is:

[0086] CCGCTCGAGCGTTGCGGCACTCGCATTTCT (SEQ ID NO: 9)

[0087] The thus synthesized DNA probe was used to screen the SCNoligo(dT) cDNA library with both hybridization and washing temperatureat 65° C. This endeavor lead to the identification of two additionalguanylyl cyclase genes, namely HG-gcy-2 (3499 bps) (SEQ ID NO: 4) (theprotein fragment being given as SEQ ID NO: 5) and HG-gcy-3 (3007 bps)(SEQ ID NO: 6) (the protein fragment being given as SEQ ID NO: 7). Thesesequences may be elongated in accordance with known techniques toprovide the full length sequences thereof.

[0088] The foregoing is illustrative of the present invention, and isnot to be construed as limiting thereof. The invention is defined by thefollowing claims, with equivalents of the claims to be included therein.

1 9 1 3762 DNA Heterodera glycines CDS (78)..(3533) 1 gggatttgaatcccgacaaa tcgccttttt aatgaattca tttcatttta aaatttcctt 60 gcccaaaatctctcaaa atg gaa atg ccg tcc tgt ttc ttc ctc ctt ttc 110 Met Glu Met ProSer Cys Phe Phe Leu Leu Phe 1 5 10 ttt ctt atg ctt ttt gtc agc cct tctcgg cac caa tta gtc act gtt 158 Phe Leu Met Leu Phe Val Ser Pro Ser ArgHis Gln Leu Val Thr Val 15 20 25 agc aac tca tcg tct tcg ccc att ggc accacc gtc gct ttt ggc act 206 Ser Asn Ser Ser Ser Ser Pro Ile Gly Thr ThrVal Ala Phe Gly Thr 30 35 40 cct tcg ccg atc atc cca atc tct act gca ccctcc acg aat ggc act 254 Pro Ser Pro Ile Ile Pro Ile Ser Thr Ala Pro SerThr Asn Gly Thr 45 50 55 gcc act ttt ggc gtc cct ccg ccg atc agt ccg ccgatt aat tct tcg 302 Ala Thr Phe Gly Val Pro Pro Pro Ile Ser Pro Pro IleAsn Ser Ser 60 65 70 75 tcc tct ctc cca tca act ggt cct ttg gaa gca tcggtt cag tta aaa 350 Ser Ser Leu Pro Ser Thr Gly Pro Leu Glu Ala Ser ValGln Leu Lys 80 85 90 atc ggc ttc ctc ttt gct aac ggc acc caa cgg ttg cgaatg ctt ttc 398 Ile Gly Phe Leu Phe Ala Asn Gly Thr Gln Arg Leu Arg MetLeu Phe 95 100 105 ggc ttt ggc caa tcc gcg ccc gcc gtc act ttg gca ctcgaa cgg gcg 446 Gly Phe Gly Gln Ser Ala Pro Ala Val Thr Leu Ala Leu GluArg Ala 110 115 120 agg cag gag cac ctc atc gac agc atc aac ttc act tacacg tgg cga 494 Arg Gln Glu His Leu Ile Asp Ser Ile Asn Phe Thr Tyr ThrTrp Arg 125 130 135 atg tgc ggc tgc ttt cag cct tgg gct gtc ggc tac gccact caa ctg 542 Met Cys Gly Cys Phe Gln Pro Trp Ala Val Gly Tyr Ala ThrGln Leu 140 145 150 155 gtt ctg acg gaa aat gtg gac gct ttg atc ggt ccgcct tgt gcc atc 590 Val Leu Thr Glu Asn Val Asp Ala Leu Ile Gly Pro ProCys Ala Ile 160 165 170 gcc gcg gga tac gtg gcc tcc ttc tac aac att ccactg tat ttg tgg 638 Ala Ala Gly Tyr Val Ala Ser Phe Tyr Asn Ile Pro LeuTyr Leu Trp 175 180 185 ggt gct act gtg gcc tcg gaa ttt tac aac act accgta tac cct aca 686 Gly Ala Thr Val Ala Ser Glu Phe Tyr Asn Thr Thr ValTyr Pro Thr 190 195 200 ctg aac aac gtg aac gtt aac tcg gac atg ttg gcgttg gcc tta caa 734 Leu Asn Asn Val Asn Val Asn Ser Asp Met Leu Ala LeuAla Leu Gln 205 210 215 agt gtg ttg gtg caa ttc aat tgg aca gaa gtg tccttc gtg tac act 782 Ser Val Leu Val Gln Phe Asn Trp Thr Glu Val Ser PheVal Tyr Thr 220 225 230 235 ccg gac aat gag cga atg gtc tgt aac tcg gtgaaa cag agt ctc aca 830 Pro Asp Asn Glu Arg Met Val Cys Asn Ser Val LysGln Ser Leu Thr 240 245 250 aat gtg ctc aac gtg acc aat gtg acc att gttttc cag cat cag atg 878 Asn Val Leu Asn Val Thr Asn Val Thr Ile Val PheGln His Gln Met 255 260 265 gag tcc aat gtg gac agt atg aag gcg acg ctgaga aat ctg cgc agc 926 Glu Ser Asn Val Asp Ser Met Lys Ala Thr Leu ArgAsn Leu Arg Ser 270 275 280 cga tcg cga att gtg ctt tcc tgt ttc gat gtcgag gtt gac cgt cgc 974 Arg Ser Arg Ile Val Leu Ser Cys Phe Asp Val GluVal Asp Arg Arg 285 290 295 aac ttt ctg ttg tcc att ttc gac act ggt cttgct gcg gac aac gaa 1022 Asn Phe Leu Leu Ser Ile Phe Asp Thr Gly Leu AlaAla Asp Asn Glu 300 305 310 315 ttt gtg ttc atc atg gga tcc ctg cgc aaccag ggc atg ctc cag cag 1070 Phe Val Phe Ile Met Gly Ser Leu Arg Asn GlnGly Met Leu Gln Gln 320 325 330 gtt gcg tcg cgt gag gac ggc agt gtc aaatat gtg aac aat tgg atg 1118 Val Ala Ser Arg Glu Asp Gly Ser Val Lys TyrVal Asn Asn Trp Met 335 340 345 gac aaa aac agc cca ggc gat ggc cgc gactcg gac gca ctc gcc gcg 1166 Asp Lys Asn Ser Pro Gly Asp Gly Arg Asp SerAsp Ala Leu Ala Ala 350 355 360 aca aaa cac gtc ata atg att gac ctg gaaaac caa tcg agt gat cat 1214 Thr Lys His Val Ile Met Ile Asp Leu Glu AsnGln Ser Ser Asp His 365 370 375 ctt aac gaa ttc aac cga gat ttg agt gcgaaa ttc ggc act tat ccc 1262 Leu Asn Glu Phe Asn Arg Asp Leu Ser Ala LysPhe Gly Thr Tyr Pro 380 385 390 395 ttt ttc tgc aac gga agt tgc atg ggcggc gca gca gaa caa tcg ccg 1310 Phe Phe Cys Asn Gly Ser Cys Met Gly GlyAla Ala Glu Gln Ser Pro 400 405 410 tcg caa tac gcc agg gct ttg ttc gacaca aca tac gca tat ttt aga 1358 Ser Gln Tyr Ala Arg Ala Leu Phe Asp ThrThr Tyr Ala Tyr Phe Arg 415 420 425 gca ttg aat cgc aca atg gaa aag cgcaaa tcg aat ggg agg gat ttg 1406 Ala Leu Asn Arg Thr Met Glu Lys Arg LysSer Asn Gly Arg Asp Leu 430 435 440 ttg cgc aac ggc acg gaa ttg aac gcagaa act gcc ggg acg acc ttt 1454 Leu Arg Asn Gly Thr Glu Leu Asn Ala GluThr Ala Gly Thr Thr Phe 445 450 455 cag ggc gag acc gga cgc atc act tttgac gcc cac ggc aac cgc cag 1502 Gln Gly Glu Thr Gly Arg Ile Thr Phe AspAla His Gly Asn Arg Gln 460 465 470 475 ccg acc ttt ttt gtg acg atg ctaaac gca ctg aat gtg ccc act gtt 1550 Pro Thr Phe Phe Val Thr Met Leu AsnAla Leu Asn Val Pro Thr Val 480 485 490 atg gtg aaa gtg aac att acc aacgga gta ttg aaa atg gaa cgg ctg 1598 Met Val Lys Val Asn Ile Thr Asn GlyVal Leu Lys Met Glu Arg Leu 495 500 505 tac ggc agt gag gcg tcg ctg tgggtc aat tgg ggc ggc ttt cgg ccg 1646 Tyr Gly Ser Glu Ala Ser Leu Trp ValAsn Trp Gly Gly Phe Arg Pro 510 515 520 atg acc acg ccg ttg tgc ggc tacaac ggc aca atg tgt ggc caa aat 1694 Met Thr Thr Pro Leu Cys Gly Tyr AsnGly Thr Met Cys Gly Gln Asn 525 530 535 gtg acg gtg tac att ctg atc ggcgtt acg ctt atg ttg ctg ttg ctg 1742 Val Thr Val Tyr Ile Leu Ile Gly ValThr Leu Met Leu Leu Leu Leu 540 545 550 555 gtc gcc gct ttg ctt ggc atcgga tac gca att cgg gag aaa atg cgc 1790 Val Ala Ala Leu Leu Gly Ile GlyTyr Ala Ile Arg Glu Lys Met Arg 560 565 570 gag aag cag cgc ctg aca cgcgag tgt ttg atc cca ttt gca gag ctg 1838 Glu Lys Gln Arg Leu Thr Arg GluCys Leu Ile Pro Phe Ala Glu Leu 575 580 585 cgc aac ctg aaa gag ctg cgcagt tcg gag gaa ctg aag tcg gag acg 1886 Arg Asn Leu Lys Glu Leu Arg SerSer Glu Glu Leu Lys Ser Glu Thr 590 595 600 gag aag agc atg cgg agc atgcgt agc agt cag tcg gga agc aca cgg 1934 Glu Lys Ser Met Arg Ser Met ArgSer Ser Gln Ser Gly Ser Thr Arg 605 610 615 ctt acg gtc ggc agc cac aaagcg cag cgg gag acg gcc aat tgc gcg 1982 Leu Thr Val Gly Ser His Lys AlaGln Arg Glu Thr Ala Asn Cys Ala 620 625 630 635 ttc ttt gtg ttc aac cgggaa att gtg ctg gcc gtg aaa tac cac gtc 2030 Phe Phe Val Phe Asn Arg GluIle Val Leu Ala Val Lys Tyr His Val 640 645 650 agg gtg cga att atg tccgaa gat ttg gcc ttc att agg aag ctg cgg 2078 Arg Val Arg Ile Met Ser GluAsp Leu Ala Phe Ile Arg Lys Leu Arg 655 660 665 cag ttg gac cac gac aacatg aac aag ttg tac ggc gtg tgc acc gat 2126 Gln Leu Asp His Asp Asn MetAsn Lys Leu Tyr Gly Val Cys Thr Asp 670 675 680 ggg ccc ctt ttg ttc gcaatt tgg cgc aat tgt cag cga ggg aca tta 2174 Gly Pro Leu Leu Phe Ala IleTrp Arg Asn Cys Gln Arg Gly Thr Leu 685 690 695 aaa gaa ctg atc gcc aaggag caa tac gtt ggg gac aat tgt gtg atg 2222 Lys Glu Leu Ile Ala Lys GluGln Tyr Val Gly Asp Asn Cys Val Met 700 705 710 715 ttt gct ctg atg cgggac att gca aat ggt ctg ctc gcc atc cat caa 2270 Phe Ala Leu Met Arg AspIle Ala Asn Gly Leu Leu Ala Ile His Gln 720 725 730 tcg ttc atc gga gcccac ggg ctg ctc tcc tct gaa aat tgt ctg atc 2318 Ser Phe Ile Gly Ala HisGly Leu Leu Ser Ser Glu Asn Cys Leu Ile 735 740 745 aat gac cgg tgg caagtg aaa atc agc gac ttt ggc ctg aat atg atc 2366 Asn Asp Arg Trp Gln ValLys Ile Ser Asp Phe Gly Leu Asn Met Ile 750 755 760 aga gaa agt caa acgctg tcg aag aaa gca ctt ttg tgg acg gcg cct 2414 Arg Glu Ser Gln Thr LeuSer Lys Lys Ala Leu Leu Trp Thr Ala Pro 765 770 775 gaa ctt ttg cga gaaaac aat cgg aag gga gca aaa gag ggc gat gtg 2462 Glu Leu Leu Arg Glu AsnAsn Arg Lys Gly Ala Lys Glu Gly Asp Val 780 785 790 795 ttc agt ttt gcgatc att tgt gtg gaa atg atg aac aga gag acg gtg 2510 Phe Ser Phe Ala IleIle Cys Val Glu Met Met Asn Arg Glu Thr Val 800 805 810 tgg aac gga gtggaa agg gac caa gac atc gat gaa atc ctt tat cgg 2558 Trp Asn Gly Val GluArg Asp Gln Asp Ile Asp Glu Ile Leu Tyr Arg 815 820 825 ctc aga cgc accaac acc aca atc cct cac cgt ccg cag ctt cat ccc 2606 Leu Arg Arg Thr AsnThr Thr Ile Pro His Arg Pro Gln Leu His Pro 830 835 840 cgc gca gag attaac caa agt ttg ctt cat ctg atc aga gac tgt tgg 2654 Arg Ala Glu Ile AsnGln Ser Leu Leu His Leu Ile Arg Asp Cys Trp 845 850 855 tcc gaa gtg ccgtcc gaa cgt ccg cgc atg gac att gtg cga acg atg 2702 Ser Glu Val Pro SerGlu Arg Pro Arg Met Asp Ile Val Arg Thr Met 860 865 870 875 ctc aaa cagatg gtc cag gac ggc agt caa aat ctg atg gat tac gtg 2750 Leu Lys Gln MetVal Gln Asp Gly Ser Gln Asn Leu Met Asp Tyr Val 880 885 890 ttc ggc atgttg gag cag tac gcg agt tcg ctg gag cag gag gtg gag 2798 Phe Gly Met LeuGlu Gln Tyr Ala Ser Ser Leu Glu Gln Glu Val Glu 895 900 905 gaa cgg accaaa gag ttg gtg gag gag aag cgc aag agc gac att ctt 2846 Glu Arg Thr LysGlu Leu Val Glu Glu Lys Arg Lys Ser Asp Ile Leu 910 915 920 ctc tac cggatg ttg ccg cgg cag gtg gcg gac aaa ctg aag ata ggc 2894 Leu Tyr Arg MetLeu Pro Arg Gln Val Ala Asp Lys Leu Lys Ile Gly 925 930 935 gag tct gtggag cca gaa tcc ttc caa atg gcc acc att ttc ttc tcc 2942 Glu Ser Val GluPro Glu Ser Phe Gln Met Ala Thr Ile Phe Phe Ser 940 945 950 955 gac gtcgtc tcc ttc acc act ttg gcc ggc aaa tgc tcg cca ttg caa 2990 Asp Val ValSer Phe Thr Thr Leu Ala Gly Lys Cys Ser Pro Leu Gln 960 965 970 gtt gtgaat ctg ctc aac ggt ctg ttc aca gcc ttt gac ggg atc att 3038 Val Val AsnLeu Leu Asn Gly Leu Phe Thr Ala Phe Asp Gly Ile Ile 975 980 985 gac actcat gac tgc tac aaa gtt gaa acc att ggc gat ggc tat ttg 3086 Asp Thr HisAsp Cys Tyr Lys Val Glu Thr Ile Gly Asp Gly Tyr Leu 990 995 1000 gtc tgttcg ggc att ccg aag cgc aac ggc gac caa cac gcg aaa 3131 Val Cys Ser GlyIle Pro Lys Arg Asn Gly Asp Gln His Ala Lys 1005 1010 1015 gaa ata gccgaa ctt tcg ttc gcc ttc ctt cgc act gtg tcc agc 3176 Glu Ile Ala Glu LeuSer Phe Ala Phe Leu Arg Thr Val Ser Ser 1020 1025 1030 ttc cgt gtc gatcac ctc ccc tcc gaa cgg gtc aac ctt cgc att 3221 Phe Arg Val Asp His LeuPro Ser Glu Arg Val Asn Leu Arg Ile 1035 1040 1045 ggc ttc cat tcc ggacca gcg gtc gct ggc gtc gtc gga ctg aca 3266 Gly Phe His Ser Gly Pro AlaVal Ala Gly Val Val Gly Leu Thr 1050 1055 1060 atg ccg cgc tat tgt ctcttt ggg gac tca gtg aac acg gcc agc 3311 Met Pro Arg Tyr Cys Leu Phe GlyAsp Ser Val Asn Thr Ala Ser 1065 1070 1075 cga atg gag tca aac gga aaggca ggc cga gtg cac att tca tca 3356 Arg Met Glu Ser Asn Gly Lys Ala GlyArg Val His Ile Ser Ser 1080 1085 1090 agt gcc aac cac ttt ttg acc agtgta atc ggc gga tat gtg aca 3401 Ser Ala Asn His Phe Leu Thr Ser Val IleGly Gly Tyr Val Thr 1095 1100 1105 gag cca aga ggc gaa gtg att ata aagggc aaa gga gtg atg gag 3446 Glu Pro Arg Gly Glu Val Ile Ile Lys Gly LysGly Val Met Glu 1110 1115 1120 acc ttt tgg ctg tta ggg cga att gga gaggca cat ttg tcg gag 3491 Thr Phe Trp Leu Leu Gly Arg Ile Gly Glu Ala HisLeu Ser Glu 1125 1130 1135 ggc aca gcg gaa aga aat gcg agt gcc gca acgaga aaa tga 3533 Gly Thr Ala Glu Arg Asn Ala Ser Ala Ala Thr Arg Lys1140 1145 1150 agaaacatca cacggcattc cctctgatca ctcattttaa tgactcgaaatcattgacca 3593 attttaatga attttaatct cttttattat tatgatagcg caatttttgcgcacatttaa 3653 gcgataacaa tttttatatt aaagttcccc ttaacaaatt tactattgtaaatactgtct 3713 cgaatacaaa aaatgtataa tttactatta aaaaaaaaaa aaaaaaaaa3762 2 1151 PRT Heterodera glycines 2 Met Glu Met Pro Ser Cys Phe PheLeu Leu Phe Phe Leu Met Leu Phe 1 5 10 15 Val Ser Pro Ser Arg His GlnLeu Val Thr Val Ser Asn Ser Ser Ser 20 25 30 Ser Pro Ile Gly Thr Thr ValAla Phe Gly Thr Pro Ser Pro Ile Ile 35 40 45 Pro Ile Ser Thr Ala Pro SerThr Asn Gly Thr Ala Thr Phe Gly Val 50 55 60 Pro Pro Pro Ile Ser Pro ProIle Asn Ser Ser Ser Ser Leu Pro Ser 65 70 75 80 Thr Gly Pro Leu Glu AlaSer Val Gln Leu Lys Ile Gly Phe Leu Phe 85 90 95 Ala Asn Gly Thr Gln ArgLeu Arg Met Leu Phe Gly Phe Gly Gln Ser 100 105 110 Ala Pro Ala Val ThrLeu Ala Leu Glu Arg Ala Arg Gln Glu His Leu 115 120 125 Ile Asp Ser IleAsn Phe Thr Tyr Thr Trp Arg Met Cys Gly Cys Phe 130 135 140 Gln Pro TrpAla Val Gly Tyr Ala Thr Gln Leu Val Leu Thr Glu Asn 145 150 155 160 ValAsp Ala Leu Ile Gly Pro Pro Cys Ala Ile Ala Ala Gly Tyr Val 165 170 175Ala Ser Phe Tyr Asn Ile Pro Leu Tyr Leu Trp Gly Ala Thr Val Ala 180 185190 Ser Glu Phe Tyr Asn Thr Thr Val Tyr Pro Thr Leu Asn Asn Val Asn 195200 205 Val Asn Ser Asp Met Leu Ala Leu Ala Leu Gln Ser Val Leu Val Gln210 215 220 Phe Asn Trp Thr Glu Val Ser Phe Val Tyr Thr Pro Asp Asn GluArg 225 230 235 240 Met Val Cys Asn Ser Val Lys Gln Ser Leu Thr Asn ValLeu Asn Val 245 250 255 Thr Asn Val Thr Ile Val Phe Gln His Gln Met GluSer Asn Val Asp 260 265 270 Ser Met Lys Ala Thr Leu Arg Asn Leu Arg SerArg Ser Arg Ile Val 275 280 285 Leu Ser Cys Phe Asp Val Glu Val Asp ArgArg Asn Phe Leu Leu Ser 290 295 300 Ile Phe Asp Thr Gly Leu Ala Ala AspAsn Glu Phe Val Phe Ile Met 305 310 315 320 Gly Ser Leu Arg Asn Gln GlyMet Leu Gln Gln Val Ala Ser Arg Glu 325 330 335 Asp Gly Ser Val Lys TyrVal Asn Asn Trp Met Asp Lys Asn Ser Pro 340 345 350 Gly Asp Gly Arg AspSer Asp Ala Leu Ala Ala Thr Lys His Val Ile 355 360 365 Met Ile Asp LeuGlu Asn Gln Ser Ser Asp His Leu Asn Glu Phe Asn 370 375 380 Arg Asp LeuSer Ala Lys Phe Gly Thr Tyr Pro Phe Phe Cys Asn Gly 385 390 395 400 SerCys Met Gly Gly Ala Ala Glu Gln Ser Pro Ser Gln Tyr Ala Arg 405 410 415Ala Leu Phe Asp Thr Thr Tyr Ala Tyr Phe Arg Ala Leu Asn Arg Thr 420 425430 Met Glu Lys Arg Lys Ser Asn Gly Arg Asp Leu Leu Arg Asn Gly Thr 435440 445 Glu Leu Asn Ala Glu Thr Ala Gly Thr Thr Phe Gln Gly Glu Thr Gly450 455 460 Arg Ile Thr Phe Asp Ala His Gly Asn Arg Gln Pro Thr Phe PheVal 465 470 475 480 Thr Met Leu Asn Ala Leu Asn Val Pro Thr Val Met ValLys Val Asn 485 490 495 Ile Thr Asn Gly Val Leu Lys Met Glu Arg Leu TyrGly Ser Glu Ala 500 505 510 Ser Leu Trp Val Asn Trp Gly Gly Phe Arg ProMet Thr Thr Pro Leu 515 520 525 Cys Gly Tyr Asn Gly Thr Met Cys Gly GlnAsn Val Thr Val Tyr Ile 530 535 540 Leu Ile Gly Val Thr Leu Met Leu LeuLeu Leu Val Ala Ala Leu Leu 545 550 555 560 Gly Ile Gly Tyr Ala Ile ArgGlu Lys Met Arg Glu Lys Gln Arg Leu 565 570 575 Thr Arg Glu Cys Leu IlePro Phe Ala Glu Leu Arg Asn Leu Lys Glu 580 585 590 Leu Arg Ser Ser GluGlu Leu Lys Ser Glu Thr Glu Lys Ser Met Arg 595 600 605 Ser Met Arg SerSer Gln Ser Gly Ser Thr Arg Leu Thr Val Gly Ser 610 615 620 His Lys AlaGln Arg Glu Thr Ala Asn Cys Ala Phe Phe Val Phe Asn 625 630 635 640 ArgGlu Ile Val Leu Ala Val Lys Tyr His Val Arg Val Arg Ile Met 645 650 655Ser Glu Asp Leu Ala Phe Ile Arg Lys Leu Arg Gln Leu Asp His Asp 660 665670 Asn Met Asn Lys Leu Tyr Gly Val Cys Thr Asp Gly Pro Leu Leu Phe 675680 685 Ala Ile Trp Arg Asn Cys Gln Arg Gly Thr Leu Lys Glu Leu Ile Ala690 695 700 Lys Glu Gln Tyr Val Gly Asp Asn Cys Val Met Phe Ala Leu MetArg 705 710 715 720 Asp Ile Ala Asn Gly Leu Leu Ala Ile His Gln Ser PheIle Gly Ala 725 730 735 His Gly Leu Leu Ser Ser Glu Asn Cys Leu Ile AsnAsp Arg Trp Gln 740 745 750 Val Lys Ile Ser Asp Phe Gly Leu Asn Met IleArg Glu Ser Gln Thr 755 760 765 Leu Ser Lys Lys Ala Leu Leu Trp Thr AlaPro Glu Leu Leu Arg Glu 770 775 780 Asn Asn Arg Lys Gly Ala Lys Glu GlyAsp Val Phe Ser Phe Ala Ile 785 790 795 800 Ile Cys Val Glu Met Met AsnArg Glu Thr Val Trp Asn Gly Val Glu 805 810 815 Arg Asp Gln Asp Ile AspGlu Ile Leu Tyr Arg Leu Arg Arg Thr Asn 820 825 830 Thr Thr Ile Pro HisArg Pro Gln Leu His Pro Arg Ala Glu Ile Asn 835 840 845 Gln Ser Leu LeuHis Leu Ile Arg Asp Cys Trp Ser Glu Val Pro Ser 850 855 860 Glu Arg ProArg Met Asp Ile Val Arg Thr Met Leu Lys Gln Met Val 865 870 875 880 GlnAsp Gly Ser Gln Asn Leu Met Asp Tyr Val Phe Gly Met Leu Glu 885 890 895Gln Tyr Ala Ser Ser Leu Glu Gln Glu Val Glu Glu Arg Thr Lys Glu 900 905910 Leu Val Glu Glu Lys Arg Lys Ser Asp Ile Leu Leu Tyr Arg Met Leu 915920 925 Pro Arg Gln Val Ala Asp Lys Leu Lys Ile Gly Glu Ser Val Glu Pro930 935 940 Glu Ser Phe Gln Met Ala Thr Ile Phe Phe Ser Asp Val Val SerPhe 945 950 955 960 Thr Thr Leu Ala Gly Lys Cys Ser Pro Leu Gln Val ValAsn Leu Leu 965 970 975 Asn Gly Leu Phe Thr Ala Phe Asp Gly Ile Ile AspThr His Asp Cys 980 985 990 Tyr Lys Val Glu Thr Ile Gly Asp Gly Tyr LeuVal Cys Ser Gly Ile 995 1000 1005 Pro Lys Arg Asn Gly Asp Gln His AlaLys Glu Ile Ala Glu Leu 1010 1015 1020 Ser Phe Ala Phe Leu Arg Thr ValSer Ser Phe Arg Val Asp His 1025 1030 1035 Leu Pro Ser Glu Arg Val AsnLeu Arg Ile Gly Phe His Ser Gly 1040 1045 1050 Pro Ala Val Ala Gly ValVal Gly Leu Thr Met Pro Arg Tyr Cys 1055 1060 1065 Leu Phe Gly Asp SerVal Asn Thr Ala Ser Arg Met Glu Ser Asn 1070 1075 1080 Gly Lys Ala GlyArg Val His Ile Ser Ser Ser Ala Asn His Phe 1085 1090 1095 Leu Thr SerVal Ile Gly Gly Tyr Val Thr Glu Pro Arg Gly Glu 1100 1105 1110 Val IleIle Lys Gly Lys Gly Val Met Glu Thr Phe Trp Leu Leu 1115 1120 1125 GlyArg Ile Gly Glu Ala His Leu Ser Glu Gly Thr Ala Glu Arg 1130 1135 1140Asn Ala Ser Ala Ala Thr Arg Lys 1145 1150 3 7127 DNA Heterodera glycines3 gaatggcgaa cagtcatttg ccgatattta tatgccaatc tccacctcta agccgaccga 60actcatccgc cggttcttgg ctcactttgc cctttcattt ggctatattt catttccata 120gctttaaatt tattcagacg tcaattgttg aattcgtttc gccattagct tttgtttatg 180acttttccac atctgttcaa ataatttgaa caccaaaaac ttggcttttt tgattgctta 240accattcatt ttgttaaatt tgccgttcat tttgtttgat cacctaaaaa atttggcgta 300gtttttgtga tcacattagg agttgggacc aagttagcca atttttaggg gtgtattttt 360cgttttttta aatcccgagc taggtggtgg attttccaaa atttttagcc gttttcattt 420tttacttatt taatgggaaa atgtaaaacg gaaaaaatta aacgcatgga actgaaaaaa 480ggaaatgaaa gatatttcaa acggaagagg aaaagaaaac tatcaaaaat taaagaaaat 540tcttaagaaa aagcaccgat tatttatata tatattttta ttttttgctg aatatatttt 600ttcattttta cttgtattat tataattttt tgtttattga taaggatttt aaaaatttcg 660atatttctta gaattaaaat ttgtcactct gtcaggactt tgaactttaa aaaatttgaa 720ctttggcggt aaaattattt aacgatgccc tcccaaacaa agctgaagag gaagtggatg 780gaatcacttc aaaagaacga attcattttg gaattttgaa atgcccttta agggattatg 840cgactgccgc aaaacgaccg taccgaatca gcttgaaaca ttttgtaagg tttctccgct 900gggatttgaa tcccgacaaa tcgccttttt aatgaattca tttcatttta aaatttcctt 960gcccaaaatc tctcaaaatg gaaatgccgt cctgtttctt cctccttttc tttcttatgc 1020tttttgtcag cccttctcgg caccaattag tcactgttag caactcatcg tcttcgccca 1080ttggcaccac cgtcgctttt ggcactcctt cgccgatcat cccaatctct actgcaccct 1140ccacgaatgg cactgccact tttggcgtcc ctccgccgat cagtccgccg attaattctt 1200cgtcctctct cccatcaact ggtcctttgg aagcatcggt tcagttaaaa atcggcttcc 1260tctttgctaa cggcacccaa cggttgcgaa tgcttttcgg ctttggccaa tccgcgcccg 1320ccgtcacttt ggcactcgaa cgggcgaggc aggagcacct catcgacagc atcaacttca 1380cgtgggcaat tggaatgaat ttagaaactc acaattttca aatcactttt tgcaaattta 1440aaaatctcca agcgagcgga aataattggc cgtaaatgcc aatttcagtt acacgtggcg 1500aatgtgcggc tgctttcagc cttgggctgt cggctacgcc actcaactgg ttctgacgga 1560aaatgtggac gctttgatcg gtccgccttg tgtgaccagt aatgcctttt tgtttgacac 1620tttgccaaaa attgcgtaaa tgaaaaaggt gccatcgccg cgggatacgt ggcctccttc 1680tacaacattc cactgtattt gtggggtgct actgtggcct cggaatttta caacactacc 1740gtatacccta cactgaacaa cgtgaacgtt aactcggaca tgtcagtgga gaaaatattc 1800ttcgccttcc atcccctaaa attattattt ctactaataa aataaactaa aatggatttg 1860ctttacgagc cgtcaccaaa tcaaatgacc aatgatttct tattaatttc aacaattatt 1920gcaggttggc gttggcctta caaagtgtgt tggtgcaatt caattggaca gaagtgtcct 1980tcgtgtacac tccggacaat gagcgaatgt aagaattatt ttgaataatt aattaattaa 2040ttagctatta atataattta attagggtct gtaactcggt gaaacagagt ctcacaaatg 2100tgctcaacgt gaccaatgtg accattgttt tccagcatca gatggagtcc aatgtggaca 2160gtatgaaggc gacgctgaga aatctgcgca accgatcgcg aagtgatggg ataattaatt 2220tgatagcatc gctaattacc atcaattagt tgtgctttcc tgtttcgatg tcgaagttga 2280acgtcgcaac tttctgttgt ccattttcga cactggtctt gctgcggaca acgaatttgt 2340gttcatcatg ggatccctgc gcaaccaggg catgctccag cagggtaatt aggcaaatgg 2400ccaaattagg ggagggataa ttaaaggggc aattgattag acctaacagt gcttcagttg 2460cgtcgcgtga ggacggcagt gtcaaatatg tgaacaattg gatggacaaa aacagcccag 2520gcgatggccg cgactcggac gcactcgccg cgacaaaaca cgtcataatg gtcaatccac 2580ggcgaagcca agcaaattta agtgccttag tgtcgttcag attgacctgg aaaaccaatc 2640gagtgatcag cttaacgaat tcaaccgaaa tttgagtgcg aaattcggca cttatccctt 2700tttctgcaac ggaagttgca tgggcggcgc aacagaacaa tcggttgcaa tttgcacgaa 2760aaatggaaat tataaaaaaa gataatgctt tttagccgtc gcaatacgcc agggctttgt 2820tcgacacaac atacgcatat tttagagcat tgaatcgcac aatggaaaag cgcaaatcga 2880atgggaggga ttgttgcgca acggcacgga attgaacgca gaaactgccg ggacgacctt 2940tcagggtgag gtggagagga agaaaaggag gggggggggt agaagcgaat ttgggagagg 3000aaccaatgga taaagcctgg caaaatgatg gagcattaag gcgagaccgg acgcatcact 3060tttgacgccc acggcaaccg ccagccgacc ttttttgtga cgatgctaaa cgcactgaat 3120gtgcccactg ttatggtgaa agtgaacatt accaacggag tattggtgcg aatggattaa 3180gcggcggcgg attcttattt ttgaaattca acagaaaatg gaacggctgt acggcagtga 3240ggcgtcgctg tgggtcaatt ggggcggctt tcggccgatg accacgccgt tgtgcggcta 3300caacggcaca atgtgtggcc aaaatgtgac ggtgtacatt ctgatcggcg ttacgcttat 3360gttgctgttg ctggtcgccg ctttgcttgg catcggatac gcaattcggt aaacagagga 3420aaagcatcgg actgtgccga tcaaaaattg atttaaaagc aaagccaatg gtcgataatt 3480ggaccaaaag gaatgttaac gccaaaatca ctcattaatt ataaatttta agcttttaat 3540gcattaattg gaccgaattt ctccgaattg gcacaactta agcccattaa tccgccgatt 3600ttgccatttg cagggagaaa atgcgcgaga agcagcgcct gacacgcgag tgtttgatcc 3660catttgcaga gctgcgcaac ctgaaagagc tgcgcagttc ggaggaactg aagtcggaga 3720cggagaagag catgcggagc atgcgtagca gtcagtcggg tgagcgtacg gtagcgacca 3780ttcgctcaaa tatggcatat ggtaacgcag gaagcacacg gcttacggtc ggcagccaca 3840aagcgcagcg ggagacggcc aattgcgcgt tctttgtgtt caaccgggaa attgtgctgg 3900ccgtgaaata ccacgtcagg gtgcgaatta tgtccgaaga tttggccttc attaggaagg 3960taaatgggac aacggcagga gaccatgttg gaaaatgtga ctaattacct aaagatgggt 4020caatcgttac catataatat atacttccgt cgttacggta cagctgcggc agttggacca 4080cgacaacatg aacaagttgt acggcgtgtg caccgatggg ccccttttgt tcgcaatttg 4140gcgcaattgt cagcgaggga cattaaaagt gcaatttgac agaaaataac atttgttcag 4200taaaataaca tgtcatgtca ttaaggaact gatcgccaag gagcaatacg ttggggacaa 4260ttgtgtgatg tttgctctga tgcgggacat tgcaaatgta aacactcaca tgggaagtgc 4320tttggcattg gcacttccca tcagctgttt ctttcctaaa tcggggccat tttcggggat 4380ctccgacctc gacattcttc tgtcataatt ggcggcttgc cgtttccaat ttcttctgca 4440taaattcgaa tttcggtcac tccatcccca gggtctgctc gccatccatc aatcgttcat 4500cggagcccac gggctgctct cctctgaaaa ttgtctgatc aatgaccggt ggcaagtgaa 4560aatcagcgac tttggcctga atatgatcag agaaagtcaa acgctgtcga agaaaggttg 4620gcattgggtt tacgaagtaa ttcgcgtaat ttgcgccatt taaagcactt ttgtggacgg 4680cgcctgaact tttgcgagaa aacaatcgga agggaacaaa agagggcgat gtgttcagtt 4740ttgcgatcat ttgtgtggaa atggtgaaca gagagacggt gtggaacgga gtggaaaggg 4800accaagacat cgatggtggg agggcggagg aggggatttg gaggggaaat ttcggcactc 4860ggcttttcct tttccatcgg tgccattgtc cgacctttct ttagaaatcc tttatcggct 4920cagacgcacc aacaccacaa tccctcaccg tccgcagctt catccccgcg cagagattaa 4980ccaaagtttg gtatgcgcac gcatttctca tggcattttg ctgctgctat tctgtcttat 5040accatatccc cacttaccgt aagcttcatc tgatcagaga ctgttggtcc gaagtgccgt 5100ccgaacgtcc gcgcatggac attgtgcgaa cgatgctcaa acagatggtc caggacgggt 5160cagtaagtca acagcggagc aatcaatgga cacgcttgtg atgctcgaaa gtctcgagag 5220agcagtctcg gggtttttta atgcccttga ccgggtcaaa gcttgagtat cggcgatctt 5280aagtagaaca agcgcttttc cgatccgctt ttgccccccc ccccaatttt tgcccatttc 5340ctttctcttc agcagtcaaa atctgatgga ttacgtgttc ggcatgttgg agcagtacgc 5400gagttcgctg gagcaggagg tggaggaacg gaccaaagag ttggtggagg agaagcgcaa 5460gagcgacatt cttctctacc ggatgttgcc gcggcaggtg gcggacaaac tgaagatagg 5520cgagtctgtg gagccagaat ccttccaaat ggccaccatt ttcttctccg acgtcgtctc 5580cttcaccact ttggccggca aatgctcgcc attgcaagtg ccgtaaaaaa agaaaattta 5640ccgctacact tttggaaaaa taaattgtcg catatttttc agaccccaat taatcaatta 5700atttaaatca aacaagattg atcaaaatgg gaaatactga tcaattacat tgatcaaaat 5760ggggaggaat cgactgatca atcccatccg tccccacccc tctcttctcg tttaggttgt 5820gaatctgctc aacggtctgt acacagcctt tgacgggatc attgacactc atgactgcta 5880caaagttggt aagtgaccag cgaatacctc actaatcgtc ttcaactctc tctcctccta 5940ttttattgct ttgtattagt tctaatttgc cattttaatt gccccccgcc acttctcccc 6000tcagaaacca ttggcgatgg ctatttggtc tgttcgggca ttccgaagcg caacggcgac 6060caacacgcga aagaaatagc cgaactttcg ttcgccttcc ttcgcactgt gtccagcttc 6120cgtgtcgatc acctcccctc cgaacgggtc aaccttcgca ttggcttcca ttccggttcg 6180ttttcgctat taccgaatca aaaagactcc caacggcacc ccggggcatt ccctggcttc 6240ttcccaattt ggcatttctt tacgaatgcc atggttaatt aattaattag gaccagcggt 6300cgctggcgtc gtcggactga caatgccgcg ctattgtctc tttggggact cagtgaacac 6360ggccagccga atggagtcaa acggaaaggg taaataaacg ggagaaaaag cgaaacaaaa 6420caaatcaaat taatttggca accattttca gcaggccgag tgcacatttc atcaagtgcc 6480aaccactttt tgaccagtgt aatcggcgga tatgtgacag agccaagagg cgaagtgatt 6540ataaaggtca ttaattaagg atgggggcaa tggctccaat tagtcggtta atcccattat 6600tagggcaaag gagtgatgga gaccttttgg ctgttagggc gaattggaga ggcacatttg 6660tcggagggca cagcggaaag aaatgcgagt gccgcaacga gaaaatgaag aaacatcaca 6720cggcattccc tctgatcact cattttaatg actcgaaatc attgaccaat tttaatgaat 6780tttaatctct tttattatta tgatagcgca atttttgcgc acatttaagc gataacaatt 6840tttatattaa agttcccctt aacaaattta ctattgtaaa tactgtctcg aatacaaaaa 6900atgtataatt tactattttt ctcacgatat tcatggcaaa aaggtcatcc ctaattatta 6960aacgttactc tttcatgtgt tcattaacac acaataattt tttgtctcag atttactaat 7020tacatataca ataagaacaa aaatattttt tggaaaaagt ttacaatata aagataatat 7080taaaggagca attagtgaaa atgcatataa ttagaaatga tcgagtc 7127 4 3499 DNAHeterodera glycines CDS (3)..(3347) 4 at ttc gtt ccg atg ttt ttt ggg acatcg gtt gct gtt gtt ctt tgt 47 Phe Val Pro Met Phe Phe Gly Thr Ser ValAla Val Val Leu Cys 1 5 10 15 tgg ctt ttt tgc act ttc cca acg aca ttcggc caa cag caa aat ggg 95 Trp Leu Phe Cys Thr Phe Pro Thr Thr Phe GlyGln Gln Gln Asn Gly 20 25 30 act gcg ccg ctg atc aaa gtc ggg cta atg atgccg cac aat cag tcg 143 Thr Ala Pro Leu Ile Lys Val Gly Leu Met Met ProHis Asn Gln Ser 35 40 45 tcc gat ttg tct ttt gcc cga tcc gcc ggt gcc atctca gtg gcg ctg 191 Ser Asp Leu Ser Phe Ala Arg Ser Ala Gly Ala Ile SerVal Ala Leu 50 55 60 aag cac att ttc aac gac aat ttg ttg cct ccc ggc accaat ttc agt 239 Lys His Ile Phe Asn Asp Asn Leu Leu Pro Pro Gly Thr AsnPhe Ser 65 70 75 ttc att gtc cgt ttc gaa gag tgc cta atg tcc gtc gcc gccggg tac 287 Phe Ile Val Arg Phe Glu Glu Cys Leu Met Ser Val Ala Ala GlyTyr 80 85 90 95 gcc ttc gat ttg ttg gat ggc cag caa gtg gac ctt ttc attgcg ccg 335 Ala Phe Asp Leu Leu Asp Gly Gln Gln Val Asp Leu Phe Ile AlaPro 100 105 110 ccg tgc acc gac agt gcg caa gtt gca ctt ttc gtg tcc acattt tac 383 Pro Cys Thr Asp Ser Ala Gln Val Ala Leu Phe Val Ser Thr PheTyr 115 120 125 aac atc cct tcc atc aca tgg ggc cag aat tcg gac tcc tctttc aat 431 Asn Ile Pro Ser Ile Thr Trp Gly Gln Asn Ser Asp Ser Ser PheAsn 130 135 140 tcg cag agc aat tac ccc act ttg ctg agt gcg ctt ccc aattac gcc 479 Ser Gln Ser Asn Tyr Pro Thr Leu Leu Ser Ala Leu Pro Asn TyrAla 145 150 155 gac ttt ggc caa att atc att tcg ctg tgc atc ttc ttc aagtgg tcc 527 Asp Phe Gly Gln Ile Ile Ile Ser Leu Cys Ile Phe Phe Lys TrpSer 160 165 170 175 gtc atg gca ctg att tat cag ctc agc gag acg ggt caatgc gcg tcg 575 Val Met Ala Leu Ile Tyr Gln Leu Ser Glu Thr Gly Gln CysAla Ser 180 185 190 ttc cag caa gac ttg cag atc gcg atc aat tcc aac gacaaa tgc gat 623 Phe Gln Gln Asp Leu Gln Ile Ala Ile Asn Ser Asn Asp LysCys Asp 195 200 205 atc agc tac aga gag gaa gtt aag atc agt tct gcg ggcacc agc gac 671 Ile Ser Tyr Arg Glu Glu Val Lys Ile Ser Ser Ala Gly ThrSer Asp 210 215 220 gcc caa tac acc ata agt caa att cag agc agg gcg agaatc gtc att 719 Ala Gln Tyr Thr Ile Ser Gln Ile Gln Ser Arg Ala Arg IleVal Ile 225 230 235 ctt tgc ttc gac gag ttt gtt cag ctg cgc aac ttt gccgcc aaa ctt 767 Leu Cys Phe Asp Glu Phe Val Gln Leu Arg Asn Phe Ala AlaLys Leu 240 245 250 255 cag gag ggt ggc ttg gac tcc gct gac tac gtt tatctc atc ccc gga 815 Gln Glu Gly Gly Leu Asp Ser Ala Asp Tyr Val Tyr LeuIle Pro Gly 260 265 270 ctc acc atg gat gat agt att gaa agt gtt aat tgtgtc ttt tat aaa 863 Leu Thr Met Asp Asp Ser Ile Glu Ser Val Asn Cys ValPhe Tyr Lys 275 280 285 att caa att tgc gtt tgc ttt ttc tct gtt ttt aattta ctt ttt gtt 911 Ile Gln Ile Cys Val Cys Phe Phe Ser Val Phe Asn LeuLeu Phe Val 290 295 300 ttg ggt ggc tcc aag gcc acc gcg tgg tgg gtc gacccg aac ccg acc 959 Leu Gly Gly Ser Lys Ala Thr Ala Trp Trp Val Asp ProAsn Pro Thr 305 310 315 atc caa tca gcg gcc tac aga att gct cag cgc agtctt tat ctg atg 1007 Ile Gln Ser Ala Ala Tyr Arg Ile Ala Gln Arg Ser LeuTyr Leu Met 320 325 330 335 ttg gac atc ttc aac aaa gtc gca act tca ggtcaa gtg ggc aac ggc 1055 Leu Asp Ile Phe Asn Lys Val Ala Thr Ser Gly GlnVal Gly Asn Gly 340 345 350 act tcg ttt gat cag gaa gtg atc aga cag gtcacc caa tgg ccc ttc 1103 Thr Ser Phe Asp Gln Glu Val Ile Arg Gln Val ThrGln Trp Pro Phe 355 360 365 ttc tgt acc gat tgc gat cag tcg ttg cag gcttct tct tac gcc cct 1151 Phe Cys Thr Asp Cys Asp Gln Ser Leu Gln Ala SerSer Tyr Ala Pro 370 375 380 ttg ctc cac gac agt ttc tat ttg tat gcc atggcc ctt tcc aaa gcg 1199 Leu Leu His Asp Ser Phe Tyr Leu Tyr Ala Met AlaLeu Ser Lys Ala 385 390 395 gca aaa att gcc ggc gca ttg tca cct tcc gtttac cga aat ggc caa 1247 Ala Lys Ile Ala Gly Ala Leu Ser Pro Ser Val TyrArg Asn Gly Gln 400 405 410 415 ttg att cgc tcc caa acc gcc aat ttg tctttt gaa gga atg acg ggg 1295 Leu Ile Arg Ser Gln Thr Ala Asn Leu Ser PheGlu Gly Met Thr Gly 420 425 430 tca aac aaa ttt gga tct gat gga ctt cgtaat ttc att tac ctt gtc 1343 Ser Asn Lys Phe Gly Ser Asp Gly Leu Arg AsnPhe Ile Tyr Leu Val 435 440 445 tcc atg tat tcg agc ttg aac ggt gac ttgact tcg tat gtg tgg ctc 1391 Ser Met Tyr Ser Ser Leu Asn Gly Asp Leu ThrSer Tyr Val Trp Leu 450 455 460 caa atg aac gat gcc gga gtg aat tct tcatgg att aat gcc acg gcc 1439 Gln Met Asn Asp Ala Gly Val Asn Ser Ser TrpIle Asn Ala Thr Ala 465 470 475 gag aag ctg att tgg tcg agc cga aac ggcgtt aag cca ttg gcc gtg 1487 Glu Lys Leu Ile Trp Ser Ser Arg Asn Gly ValLys Pro Leu Ala Val 480 485 490 495 ccg ttg tgc gga ttt gac ggc aac ggctgt cac atg gac ttc ttc acg 1535 Pro Leu Cys Gly Phe Asp Gly Asn Gly CysHis Met Asp Phe Phe Thr 500 505 510 gag tac cgt ggg tat gtg ata gct gccggc tgt ctg ttg ctg ctc att 1583 Glu Tyr Arg Gly Tyr Val Ile Ala Ala GlyCys Leu Leu Leu Leu Ile 515 520 525 ttg ggc tcg ttc gcc ttc ggc att tactgg ctg ttc caa tcc aag gcg 1631 Leu Gly Ser Phe Ala Phe Gly Ile Tyr TrpLeu Phe Gln Ser Lys Ala 530 535 540 cgc gag atg gaa cgg caa aat cgc ctctgg caa atc gcc tac agt act 1679 Arg Glu Met Glu Arg Gln Asn Arg Leu TrpGln Ile Ala Tyr Ser Thr 545 550 555 ctg acg ccg gcg ggc acc aaa aag aaaatg atg gaa agt gtg cgc tct 1727 Leu Thr Pro Ala Gly Thr Lys Lys Lys MetMet Glu Ser Val Arg Ser 560 565 570 575 ctc cag tcg agc act tct tct cagttc acg cgc gac tcc tcc cat tcc 1775 Leu Gln Ser Ser Thr Ser Ser Gln PheThr Arg Asp Ser Ser His Ser 580 585 590 cac gtt tcc atc aaa cac aac ttcaat ggc atc gtg tac att atg aac 1823 His Val Ser Ile Lys His Asn Phe AsnGly Ile Val Tyr Ile Met Asn 595 600 605 ggc gag cgg gtg atc ggc att cagcat tcg gtt ggc att cga ctc agt 1871 Gly Glu Arg Val Ile Gly Ile Gln HisSer Val Gly Ile Arg Leu Ser 610 615 620 cca cag gac atg gcc gag ctg agaact atg cgc ctt ttg gat gga gac 1919 Pro Gln Asp Met Ala Glu Leu Arg ThrMet Arg Leu Leu Asp Gly Asp 625 630 635 aat gtg aac cga ttc atc ggc ctttcc atc gat ggc gcc gcg ctt ctc 1967 Asn Val Asn Arg Phe Ile Gly Leu SerIle Asp Gly Ala Ala Leu Leu 640 645 650 655 tcc ctg tgg cgc tac tgc tcgcgt ggc ccc ctt tcg gac gtg atc tcg 2015 Ser Leu Trp Arg Tyr Cys Ser ArgGly Pro Leu Ser Asp Val Ile Ser 660 665 670 ggc tct tcc tct ctg acc atggac ggc ttc ttc att tat tcg ttg gtc 2063 Gly Ser Ser Ser Leu Thr Met AspGly Phe Phe Ile Tyr Ser Leu Val 675 680 685 cgc gac gtt gcc gaa gga ttgcgc ttc ctt cac gcg tcc tca att gga 2111 Arg Asp Val Ala Glu Gly Leu ArgPhe Leu His Ala Ser Ser Ile Gly 690 695 700 tgg tat ggc aat ttg cgt tccacc aac tgt ttg atc gac gac cgt tgg 2159 Trp Tyr Gly Asn Leu Arg Ser ThrAsn Cys Leu Ile Asp Asp Arg Trp 705 710 715 caa ata aaa ctg tcc gag tttggt ctc cgc ttc ttt cgt gca cac gaa 2207 Gln Ile Lys Leu Ser Glu Phe GlyLeu Arg Phe Phe Arg Ala His Glu 720 725 730 735 aaa cgg gag gca aaa gatttg gtt tgg aca gcg cca gaa ttg ttg cgc 2255 Lys Arg Glu Ala Lys Asp LeuVal Trp Thr Ala Pro Glu Leu Leu Arg 740 745 750 gat aat gac atc gtt ggcaac aaa ttt ggc gat gtt tac agc ttt tcc 2303 Asp Asn Asp Ile Val Gly AsnLys Phe Gly Asp Val Tyr Ser Phe Ser 755 760 765 atc gtt tct tcc gaa attgtg aat atg aag cca att tgg gag cag gac 2351 Ile Val Ser Ser Glu Ile ValAsn Met Lys Pro Ile Trp Glu Gln Asp 770 775 780 gaa gcg aag gga aat gttgaa agg gtc cga acc ggg ggg aag agg gca 2399 Glu Ala Lys Gly Asn Val GluArg Val Arg Thr Gly Gly Lys Arg Ala 785 790 795 ttt cgt ccc aaa ttg gagccg agc agc cag gac ttg tcc ccg gca ctg 2447 Phe Arg Pro Lys Leu Glu ProSer Ser Gln Asp Leu Ser Pro Ala Leu 800 805 810 815 ctg cat ctg atc aaagac tgc tgg gac gaa agc cct gca gaa cgg cca 2495 Leu His Leu Ile Lys AspCys Trp Asp Glu Ser Pro Ala Glu Arg Pro 820 825 830 aaa atg gag acg gtgacc gca ctt ttg cag tca atg aac acg gga agg 2543 Lys Met Glu Thr Val ThrAla Leu Leu Gln Ser Met Asn Thr Gly Arg 835 840 845 agc acc aat ttg atggac cac gtg ttc aat atg ctg gaa gtg tac gcc 2591 Ser Thr Asn Leu Met AspHis Val Phe Asn Met Leu Glu Val Tyr Ala 850 855 860 ggc tca ttg gag gaggaa gtt gag gaa cgg acc aaa gag ttg gtg gag 2639 Gly Ser Leu Glu Glu GluVal Glu Glu Arg Thr Lys Glu Leu Val Glu 865 870 875 gag aag aag aag acggac atc ctt ctc tac cga atg ctg ccc aaa caa 2687 Glu Lys Lys Lys Thr AspIle Leu Leu Tyr Arg Met Leu Pro Lys Gln 880 885 890 895 gtc gcc gac aaactc aaa ttg ggc caa tct gtg gag ccc gaa acc ttc 2735 Val Ala Asp Lys LeuLys Leu Gly Gln Ser Val Glu Pro Glu Thr Phe 900 905 910 gac tgc gtt accgta ttc ttc tcg gac gtc gtc tca ttc aca aca atc 2783 Asp Cys Val Thr ValPhe Phe Ser Asp Val Val Ser Phe Thr Thr Ile 915 920 925 gct tca aaa tgctca cct ttg cag gtg gtc aat ttg ctg aac aat ctg 2831 Ala Ser Lys Cys SerPro Leu Gln Val Val Asn Leu Leu Asn Asn Leu 930 935 940 tac act ctg ttggac tca atc atc gcc gaa ttt gac gtg tac aaa gtt 2879 Tyr Thr Leu Leu AspSer Ile Ile Ala Glu Phe Asp Val Tyr Lys Val 945 950 955 gag aca att ggcgat ggt tat ttg tgc gtg tcg ggc ctt ccc cac cgc 2927 Glu Thr Ile Gly AspGly Tyr Leu Cys Val Ser Gly Leu Pro His Arg 960 965 970 975 aat ggg catgaa cac gcg caa cac atc gcc aaa atg tcg ttg gca ttc 2975 Asn Gly His GluHis Ala Gln His Ile Ala Lys Met Ser Leu Ala Phe 980 985 990 atg cgc aacttg ggc agc ttc acc att ccc cac ttg ccc att gaa cgg 3023 Met Arg Asn LeuGly Ser Phe Thr Ile Pro His Leu Pro Ile Glu Arg 995 1000 1005 ctt cgtctc cgc att ggc att cac acc ggc tcc acc gtg gcg ggc 3068 Leu Arg Leu ArgIle Gly Ile His Thr Gly Ser Thr Val Ala Gly 1010 1015 1020 gtt gtc ggtctt tcc atg ccc cgt tat tgt ctg ttc ggc gac aca 3113 Val Val Gly Leu SerMet Pro Arg Tyr Cys Leu Phe Gly Asp Thr 1025 1030 1035 att aac aca gcggca cgg ctg gaa agc agc tca aag ccg atg cga 3158 Ile Asn Thr Ala Ala ArgLeu Glu Ser Ser Ser Lys Pro Met Arg 1040 1045 1050 att cac att tcc acgacg acg aat cac ttt ttg gtc aat gtt ctc 3203 Ile His Ile Ser Thr Thr ThrAsn His Phe Leu Val Asn Val Leu 1055 1060 1065 gga ggt ttt gtc acc caagcg cgt gga gaa att tta gtg aag gga 3248 Gly Gly Phe Val Thr Gln Ala ArgGly Glu Ile Leu Val Lys Gly 1070 1075 1080 aag ggc gtt ctc gaa acc ttttgg ctg ctt ggc ctc gaa ggc gac 3293 Lys Gly Val Leu Glu Thr Phe Trp LeuLeu Gly Leu Glu Gly Asp 1085 1090 1095 ccg gcg gtg atg cga atg ttg cacagt tcg gac ggt aat aat gcg 3338 Pro Ala Val Met Arg Met Leu His Ser SerAsp Gly Asn Asn Ala 1100 1105 1110 act acg gaa tgaacaaaaa caaattgaggaagaaattga acacaaagga 3387 Thr Thr Glu 1115 aacagaaaaa ccaaaagaatgaatgaatga atgatttgtc atttgtaaaa attaaaatgt 3447 cggacaacaa aaaaaatcgaaaggaacgaa aaaaaaaaaa aaaaaaaaaa aa 3499 5 1115 PRT Heterodera glycines5 Phe Val Pro Met Phe Phe Gly Thr Ser Val Ala Val Val Leu Cys Trp 1 5 1015 Leu Phe Cys Thr Phe Pro Thr Thr Phe Gly Gln Gln Gln Asn Gly Thr 20 2530 Ala Pro Leu Ile Lys Val Gly Leu Met Met Pro His Asn Gln Ser Ser 35 4045 Asp Leu Ser Phe Ala Arg Ser Ala Gly Ala Ile Ser Val Ala Leu Lys 50 5560 His Ile Phe Asn Asp Asn Leu Leu Pro Pro Gly Thr Asn Phe Ser Phe 65 7075 80 Ile Val Arg Phe Glu Glu Cys Leu Met Ser Val Ala Ala Gly Tyr Ala 8590 95 Phe Asp Leu Leu Asp Gly Gln Gln Val Asp Leu Phe Ile Ala Pro Pro100 105 110 Cys Thr Asp Ser Ala Gln Val Ala Leu Phe Val Ser Thr Phe TyrAsn 115 120 125 Ile Pro Ser Ile Thr Trp Gly Gln Asn Ser Asp Ser Ser PheAsn Ser 130 135 140 Gln Ser Asn Tyr Pro Thr Leu Leu Ser Ala Leu Pro AsnTyr Ala Asp 145 150 155 160 Phe Gly Gln Ile Ile Ile Ser Leu Cys Ile PhePhe Lys Trp Ser Val 165 170 175 Met Ala Leu Ile Tyr Gln Leu Ser Glu ThrGly Gln Cys Ala Ser Phe 180 185 190 Gln Gln Asp Leu Gln Ile Ala Ile AsnSer Asn Asp Lys Cys Asp Ile 195 200 205 Ser Tyr Arg Glu Glu Val Lys IleSer Ser Ala Gly Thr Ser Asp Ala 210 215 220 Gln Tyr Thr Ile Ser Gln IleGln Ser Arg Ala Arg Ile Val Ile Leu 225 230 235 240 Cys Phe Asp Glu PheVal Gln Leu Arg Asn Phe Ala Ala Lys Leu Gln 245 250 255 Glu Gly Gly LeuAsp Ser Ala Asp Tyr Val Tyr Leu Ile Pro Gly Leu 260 265 270 Thr Met AspAsp Ser Ile Glu Ser Val Asn Cys Val Phe Tyr Lys Ile 275 280 285 Gln IleCys Val Cys Phe Phe Ser Val Phe Asn Leu Leu Phe Val Leu 290 295 300 GlyGly Ser Lys Ala Thr Ala Trp Trp Val Asp Pro Asn Pro Thr Ile 305 310 315320 Gln Ser Ala Ala Tyr Arg Ile Ala Gln Arg Ser Leu Tyr Leu Met Leu 325330 335 Asp Ile Phe Asn Lys Val Ala Thr Ser Gly Gln Val Gly Asn Gly Thr340 345 350 Ser Phe Asp Gln Glu Val Ile Arg Gln Val Thr Gln Trp Pro PhePhe 355 360 365 Cys Thr Asp Cys Asp Gln Ser Leu Gln Ala Ser Ser Tyr AlaPro Leu 370 375 380 Leu His Asp Ser Phe Tyr Leu Tyr Ala Met Ala Leu SerLys Ala Ala 385 390 395 400 Lys Ile Ala Gly Ala Leu Ser Pro Ser Val TyrArg Asn Gly Gln Leu 405 410 415 Ile Arg Ser Gln Thr Ala Asn Leu Ser PheGlu Gly Met Thr Gly Ser 420 425 430 Asn Lys Phe Gly Ser Asp Gly Leu ArgAsn Phe Ile Tyr Leu Val Ser 435 440 445 Met Tyr Ser Ser Leu Asn Gly AspLeu Thr Ser Tyr Val Trp Leu Gln 450 455 460 Met Asn Asp Ala Gly Val AsnSer Ser Trp Ile Asn Ala Thr Ala Glu 465 470 475 480 Lys Leu Ile Trp SerSer Arg Asn Gly Val Lys Pro Leu Ala Val Pro 485 490 495 Leu Cys Gly PheAsp Gly Asn Gly Cys His Met Asp Phe Phe Thr Glu 500 505 510 Tyr Arg GlyTyr Val Ile Ala Ala Gly Cys Leu Leu Leu Leu Ile Leu 515 520 525 Gly SerPhe Ala Phe Gly Ile Tyr Trp Leu Phe Gln Ser Lys Ala Arg 530 535 540 GluMet Glu Arg Gln Asn Arg Leu Trp Gln Ile Ala Tyr Ser Thr Leu 545 550 555560 Thr Pro Ala Gly Thr Lys Lys Lys Met Met Glu Ser Val Arg Ser Leu 565570 575 Gln Ser Ser Thr Ser Ser Gln Phe Thr Arg Asp Ser Ser His Ser His580 585 590 Val Ser Ile Lys His Asn Phe Asn Gly Ile Val Tyr Ile Met AsnGly 595 600 605 Glu Arg Val Ile Gly Ile Gln His Ser Val Gly Ile Arg LeuSer Pro 610 615 620 Gln Asp Met Ala Glu Leu Arg Thr Met Arg Leu Leu AspGly Asp Asn 625 630 635 640 Val Asn Arg Phe Ile Gly Leu Ser Ile Asp GlyAla Ala Leu Leu Ser 645 650 655 Leu Trp Arg Tyr Cys Ser Arg Gly Pro LeuSer Asp Val Ile Ser Gly 660 665 670 Ser Ser Ser Leu Thr Met Asp Gly PhePhe Ile Tyr Ser Leu Val Arg 675 680 685 Asp Val Ala Glu Gly Leu Arg PheLeu His Ala Ser Ser Ile Gly Trp 690 695 700 Tyr Gly Asn Leu Arg Ser ThrAsn Cys Leu Ile Asp Asp Arg Trp Gln 705 710 715 720 Ile Lys Leu Ser GluPhe Gly Leu Arg Phe Phe Arg Ala His Glu Lys 725 730 735 Arg Glu Ala LysAsp Leu Val Trp Thr Ala Pro Glu Leu Leu Arg Asp 740 745 750 Asn Asp IleVal Gly Asn Lys Phe Gly Asp Val Tyr Ser Phe Ser Ile 755 760 765 Val SerSer Glu Ile Val Asn Met Lys Pro Ile Trp Glu Gln Asp Glu 770 775 780 AlaLys Gly Asn Val Glu Arg Val Arg Thr Gly Gly Lys Arg Ala Phe 785 790 795800 Arg Pro Lys Leu Glu Pro Ser Ser Gln Asp Leu Ser Pro Ala Leu Leu 805810 815 His Leu Ile Lys Asp Cys Trp Asp Glu Ser Pro Ala Glu Arg Pro Lys820 825 830 Met Glu Thr Val Thr Ala Leu Leu Gln Ser Met Asn Thr Gly ArgSer 835 840 845 Thr Asn Leu Met Asp His Val Phe Asn Met Leu Glu Val TyrAla Gly 850 855 860 Ser Leu Glu Glu Glu Val Glu Glu Arg Thr Lys Glu LeuVal Glu Glu 865 870 875 880 Lys Lys Lys Thr Asp Ile Leu Leu Tyr Arg MetLeu Pro Lys Gln Val 885 890 895 Ala Asp Lys Leu Lys Leu Gly Gln Ser ValGlu Pro Glu Thr Phe Asp 900 905 910 Cys Val Thr Val Phe Phe Ser Asp ValVal Ser Phe Thr Thr Ile Ala 915 920 925 Ser Lys Cys Ser Pro Leu Gln ValVal Asn Leu Leu Asn Asn Leu Tyr 930 935 940 Thr Leu Leu Asp Ser Ile IleAla Glu Phe Asp Val Tyr Lys Val Glu 945 950 955 960 Thr Ile Gly Asp GlyTyr Leu Cys Val Ser Gly Leu Pro His Arg Asn 965 970 975 Gly His Glu HisAla Gln His Ile Ala Lys Met Ser Leu Ala Phe Met 980 985 990 Arg Asn LeuGly Ser Phe Thr Ile Pro His Leu Pro Ile Glu Arg Leu 995 1000 1005 ArgLeu Arg Ile Gly Ile His Thr Gly Ser Thr Val Ala Gly Val 1010 1015 1020Val Gly Leu Ser Met Pro Arg Tyr Cys Leu Phe Gly Asp Thr Ile 1025 10301035 Asn Thr Ala Ala Arg Leu Glu Ser Ser Ser Lys Pro Met Arg Ile 10401045 1050 His Ile Ser Thr Thr Thr Asn His Phe Leu Val Asn Val Leu Gly1055 1060 1065 Gly Phe Val Thr Gln Ala Arg Gly Glu Ile Leu Val Lys GlyLys 1070 1075 1080 Gly Val Leu Glu Thr Phe Trp Leu Leu Gly Leu Glu GlyAsp Pro 1085 1090 1095 Ala Val Met Arg Met Leu His Ser Ser Asp Gly AsnAsn Ala Thr 1100 1105 1110 Thr Glu 1115 6 3008 DNA Heterodera glycinesCDS (3)..(2849) 6 cg aaa atc aca att aat tat aaa aca cgc att ttc aac attcaa tca 47 Lys Ile Thr Ile Asn Tyr Lys Thr Arg Ile Phe Asn Ile Gln Ser 15 10 15 tcc gac aca agc aca att gtc aat gca att cga gaa cgg gcg agg atc95 Ser Asp Thr Ser Thr Ile Val Asn Ala Ile Arg Glu Arg Ala Arg Ile 20 2530 gtt ttg ctt tgc ttt gac gat ttg aag cag atg cga act ttc gca ctt 143Val Leu Leu Cys Phe Asp Asp Leu Lys Gln Met Arg Thr Phe Ala Leu 35 40 45caa ttg ttc gat gga gga cta aac aca aaa gat tat gtt tac ata atg 191 GlnLeu Phe Asp Gly Gly Leu Asn Thr Lys Asp Tyr Val Tyr Ile Met 50 55 60 gtggat aat gac atg tat tta tct ttc aat ttg acg aga tta cct ttt 239 Val AspAsn Asp Met Tyr Leu Ser Phe Asn Leu Thr Arg Leu Pro Phe 65 70 75 tgg gtacaa tcg agt aac aat tca aat acg ctc gac gga aga aac gcg 287 Trp Val GlnSer Ser Asn Asn Ser Asn Thr Leu Asp Gly Arg Asn Ala 80 85 90 95 gac gccgaa gtg att ggc cga ttg gcc tta tgg tgg cac tac gac atc 335 Asp Ala GluVal Ile Gly Arg Leu Ala Leu Trp Trp His Tyr Asp Ile 100 105 110 act ttgtcc gcg ttg tcc aat caa aat tac tac ggc ttt ttc aaa aga 383 Thr Leu SerAla Leu Ser Asn Gln Asn Tyr Tyr Gly Phe Phe Lys Arg 115 120 125 gtg atcgac aga acg ggc gat tgg ccc ttt tat tgc gat gaa tcc aat 431 Val Ile AspArg Thr Gly Asp Trp Pro Phe Tyr Cys Asp Glu Ser Asn 130 135 140 tgc agcaaa gtg atc aat gca tcc atc tat tcg ctt ctg ttg tac gac 479 Cys Ser LysVal Ile Asn Ala Ser Ile Tyr Ser Leu Leu Leu Tyr Asp 145 150 155 gca atttac aat tac gga atg gca ctg aac gaa tct ttc cgc caa ttt 527 Ala Ile TyrAsn Tyr Gly Met Ala Leu Asn Glu Ser Phe Arg Gln Phe 160 165 170 175 ggcatt cgg ccc gaa gtg tac cga aac ggc act ctg ttg gca cgg aac 575 Gly IleArg Pro Glu Val Tyr Arg Asn Gly Thr Leu Leu Ala Arg Asn 180 185 190 aacaga aag cca ttc atg ggt ttg acc ggc tat gtg acg gtg gaa act 623 Asn ArgLys Pro Phe Met Gly Leu Thr Gly Tyr Val Thr Val Glu Thr 195 200 205 gatcag aac acg cgg gtg ttc gtt ttg tcc aat cgg aag tcg agc gag 671 Asp GlnAsn Thr Arg Val Phe Val Leu Ser Asn Arg Lys Ser Ser Glu 210 215 220 aaagga aat gcg cta cgc att tta atg caa ttc gca tgg gtc gag ggg 719 Lys GlyAsn Ala Leu Arg Ile Leu Met Gln Phe Ala Trp Val Glu Gly 225 230 235 aaattg caa ata tcg ttg cga aat ggc agc ttg tcc atg tgg tcc tcc 767 Lys LeuGln Ile Ser Leu Arg Asn Gly Ser Leu Ser Met Trp Ser Ser 240 245 250 255cgc ggt gga agc att cct ccg gcg gtg cca atc tgc ggc ttc gac ggc 815 ArgGly Gly Ser Ile Pro Pro Ala Val Pro Ile Cys Gly Phe Asp Gly 260 265 270aaa ggg tgt gcc gcg tcg gtg ttc gaa atg tat aaa ggc tat tta ttg 863 LysGly Cys Ala Ala Ser Val Phe Glu Met Tyr Lys Gly Tyr Leu Leu 275 280 285ctt gga att gct ctt ttt gta gtg aca ata agc ggt agc act ttt act 911 LeuGly Ile Ala Leu Phe Val Val Thr Ile Ser Gly Ser Thr Phe Thr 290 295 300gtc ggc ttt ttg ata cac gct aaa ttt gtg gaa ggt cgg aga agc aac 959 ValGly Phe Leu Ile His Ala Lys Phe Val Glu Gly Arg Arg Ser Asn 305 310 315atg agt tgg aaa ata cca ttt gct ttg ctg acg aaa tcg aaa cca aaa 1007 MetSer Trp Lys Ile Pro Phe Ala Leu Leu Thr Lys Ser Lys Pro Lys 320 325 330335 cgt gcc gac cgc aca gcc gcc aac cga agt cgg cac tcc gtc cgc tcc 1055Arg Ala Asp Arg Thr Ala Ala Asn Arg Ser Arg His Ser Val Arg Ser 340 345350 aac caa acg aac att tcc tcg ctg acc cat tcg acc att ggc agt ttg 1103Asn Gln Thr Asn Ile Ser Ser Leu Thr His Ser Thr Ile Gly Ser Leu 355 360365 gca cgg tcc cga atc ttc tcc ctg tac tca tac aat ggg gaa aag tgc 1151Ala Arg Ser Arg Ile Phe Ser Leu Tyr Ser Tyr Asn Gly Glu Lys Cys 370 375380 att gtg cgc agc ttt ggc tcc aca aca atg gca aag gca ttt aca gtg 1199Ile Val Arg Ser Phe Gly Ser Thr Thr Met Ala Lys Ala Phe Thr Val 385 390395 aca caa atg gcc gag tgc cga acg atg cgt ctg ttc gac cat gag aat 1247Thr Gln Met Ala Glu Cys Arg Thr Met Arg Leu Phe Asp His Glu Asn 400 405410 415 gtg aac cgg ttt ttg ggg ctg agt ttg gac ggg gcc aat gtg ttg gcc1295 Val Asn Arg Phe Leu Gly Leu Ser Leu Asp Gly Ala Asn Val Leu Ala 420425 430 gtg tgg aac ttt tgc atg cgc ggg tcc atc aga gac gtg att ttg tct1343 Val Trp Asn Phe Cys Met Arg Gly Ser Ile Arg Asp Val Ile Leu Ser 435440 445 gaa aat gcc atg gtc aaa gat gtg ata ttc atc cag tcg gcc atc aaa1391 Glu Asn Ala Met Val Lys Asp Val Ile Phe Ile Gln Ser Ala Ile Lys 450455 460 gag att tgt gaa ggc att cat ttc ctg cac aat tcg ccc ctc caa ttc1439 Glu Ile Cys Glu Gly Ile His Phe Leu His Asn Ser Pro Leu Gln Phe 465470 475 cat ggc cga ctg aaa tcc tcc gct tgt ttg atc aat gac cgg tgg caa1487 His Gly Arg Leu Lys Ser Ser Ala Cys Leu Ile Asn Asp Arg Trp Gln 480485 490 495 gtc aaa att tca tat ttt ggg ctt cga tgg cta aag tct tca caaaaa 1535 Val Lys Ile Ser Tyr Phe Gly Leu Arg Trp Leu Lys Ser Ser Gln Lys500 505 510 aat cgg gcg aaa gat ctt tta tgg cta tcg cct gaa caa tta cggaaa 1583 Asn Arg Ala Lys Asp Leu Leu Trp Leu Ser Pro Glu Gln Leu Arg Lys515 520 525 atg gga gac agc gaa att gtg gag ggg tca aaa cat tct gac atttac 1631 Met Gly Asp Ser Glu Ile Val Glu Gly Ser Lys His Ser Asp Ile Tyr530 535 540 acg atg gca tta atc ttc acc gaa atg gtt aat atg tct ccg tgttgg 1679 Thr Met Ala Leu Ile Phe Thr Glu Met Val Asn Met Ser Pro Cys Trp545 550 555 gac agc agc gaa gcg gac gga gca gag gct gac cgg gcc gag gatgga 1727 Asp Ser Ser Glu Ala Asp Gly Ala Glu Ala Asp Arg Ala Glu Asp Gly560 565 570 575 gaa gag caa aac gga acg gaa atg tcg cga aga aag caa acggcg gaa 1775 Glu Glu Gln Asn Gly Thr Glu Met Ser Arg Arg Lys Gln Thr AlaGlu 580 585 590 acg gag gga gaa acg gca cag cgg cgc ccg ggg cga cgc gcgagg gga 1823 Thr Glu Gly Glu Thr Ala Gln Arg Arg Pro Gly Arg Arg Ala ArgGly 595 600 605 cgc aac gcg gag gaa atc gct tat ttg gtg aag cgg ggc ggaatc gtt 1871 Arg Asn Ala Glu Glu Ile Ala Tyr Leu Val Lys Arg Gly Gly IleVal 610 615 620 ccg ctg cgg ccg atc att cgg ccg gca ttt gac cat ctg aacacg gaa 1919 Pro Leu Arg Pro Ile Ile Arg Pro Ala Phe Asp His Leu Asn ThrGlu 625 630 635 gtg att cat ctg atc cgc gac tgt tgg gtc gaa acg ccg agcgaa cgg 1967 Val Ile His Leu Ile Arg Asp Cys Trp Val Glu Thr Pro Ser GluArg 640 645 650 655 ccg acc att gaa aaa gtg cga cag aaa ttg cgg caa atgggt gcc caa 2015 Pro Thr Ile Glu Lys Val Arg Gln Lys Leu Arg Gln Met GlyAla Gln 660 665 670 cgg agg gtc aat ttg atg gac cat gtg ttc gac atg ttggag cag tac 2063 Arg Arg Val Asn Leu Met Asp His Val Phe Asp Met Leu GluGln Tyr 675 680 685 gcc aac aaa ttg gag gag gaa gtg cag gag cgg acc aaagag ttg gag 2111 Ala Asn Lys Leu Glu Glu Glu Val Gln Glu Arg Thr Lys GluLeu Glu 690 695 700 ggg gag aag cga aag tcg gac att ctt ctc tat cgg atgatg cca cgc 2159 Gly Glu Lys Arg Lys Ser Asp Ile Leu Leu Tyr Arg Met MetPro Arg 705 710 715 caa gtg gcg gac cga cta aag ctc ggc caa tcc gtg gagccc gag cag 2207 Gln Val Ala Asp Arg Leu Lys Leu Gly Gln Ser Val Glu ProGlu Gln 720 725 730 735 ttc gac tgt gtg acg gtg ttc ttc tcg gac att gtccaa ttc gcg gca 2255 Phe Asp Cys Val Thr Val Phe Phe Ser Asp Ile Val GlnPhe Ala Ala 740 745 750 ctg tcc aac caa atg cgg ccg ctg cag gtg gtc aatctg atg aac gaa 2303 Leu Ser Asn Gln Met Arg Pro Leu Gln Val Val Asn LeuMet Asn Glu 755 760 765 ctg tac acc atc ttc gac gca atc att gac gag cacgac gtg tac aag 2351 Leu Tyr Thr Ile Phe Asp Ala Ile Ile Asp Glu His AspVal Tyr Lys 770 775 780 ggc gat ggt tat ttg tgc gtg tct ggc ctt ccc aatcgg aat ggc act 2399 Gly Asp Gly Tyr Leu Cys Val Ser Gly Leu Pro Asn ArgAsn Gly Thr 785 790 795 ttg cat gcc aaa cac tgt gct gat atg gcg atc aaattt atg caa gcg 2447 Leu His Ala Lys His Cys Ala Asp Met Ala Ile Lys PheMet Gln Ala 800 805 810 815 ctg ctc aat ttc cga att ccc gac ctt cca aatgag cgc gtc cgt ctc 2495 Leu Leu Asn Phe Arg Ile Pro Asp Leu Pro Asn GluArg Val Arg Leu 820 825 830 cga att ggg ctg cac agc ggc cca tgc gtc gcggga gtc gtc ggg ttg 2543 Arg Ile Gly Leu His Ser Gly Pro Cys Val Ala GlyVal Val Gly Leu 835 840 845 gcc atg ccc cgt tac tgt ttg ttt ggg gat acggta aac acc gcc tcg 2591 Ala Met Pro Arg Tyr Cys Leu Phe Gly Asp Thr ValAsn Thr Ala Ser 850 855 860 cgc atg gaa agt tct tca agc cca aac aaa attcac atg tcc agt gaa 2639 Arg Met Glu Ser Ser Ser Ser Pro Asn Lys Ile HisMet Ser Ser Glu 865 870 875 acg ctc gaa ttg ctg cac aaa aat ttc aac ggctct tat cac acg gag 2687 Thr Leu Glu Leu Leu His Lys Asn Phe Asn Gly SerTyr His Thr Glu 880 885 890 895 agc aga ggc gaa gtg atc ata aag ggc aaaggc gtc atg gag acc ttt 2735 Ser Arg Gly Glu Val Ile Ile Lys Gly Lys GlyVal Met Glu Thr Phe 900 905 910 tgg ctg ttg ggc caa gtc gaa aat gga acaaca att aac gcc gat tat 2783 Trp Leu Leu Gly Gln Val Glu Asn Gly Thr ThrIle Asn Ala Asp Tyr 915 920 925 gcg cat aga atg cat ctg ccg gtg atc aaattt ggg gag gag gga aat 2831 Ala His Arg Met His Leu Pro Val Ile Lys PheGly Glu Glu Gly Asn 930 935 940 gaa acc gga aaa aat gcg taaagaaatggtgatgaagc caccacattt 2879 Glu Thr Gly Lys Asn Ala 945 agactgaaatgcataatgaa agagcaaaga tcaagacttt caaaatgcct gaaattaatg 2939 taactttacccaaaatttag caaaaaattt caatttatta cgaaaaaaaa aaaaaaaaaa 2999 aaaaaaaaa3008 7 949 PRT Heterodera glycines 7 Lys Ile Thr Ile Asn Tyr Lys Thr ArgIle Phe Asn Ile Gln Ser Ser 1 5 10 15 Asp Thr Ser Thr Ile Val Asn AlaIle Arg Glu Arg Ala Arg Ile Val 20 25 30 Leu Leu Cys Phe Asp Asp Leu LysGln Met Arg Thr Phe Ala Leu Gln 35 40 45 Leu Phe Asp Gly Gly Leu Asn ThrLys Asp Tyr Val Tyr Ile Met Val 50 55 60 Asp Asn Asp Met Tyr Leu Ser PheAsn Leu Thr Arg Leu Pro Phe Trp 65 70 75 80 Val Gln Ser Ser Asn Asn SerAsn Thr Leu Asp Gly Arg Asn Ala Asp 85 90 95 Ala Glu Val Ile Gly Arg LeuAla Leu Trp Trp His Tyr Asp Ile Thr 100 105 110 Leu Ser Ala Leu Ser AsnGln Asn Tyr Tyr Gly Phe Phe Lys Arg Val 115 120 125 Ile Asp Arg Thr GlyAsp Trp Pro Phe Tyr Cys Asp Glu Ser Asn Cys 130 135 140 Ser Lys Val IleAsn Ala Ser Ile Tyr Ser Leu Leu Leu Tyr Asp Ala 145 150 155 160 Ile TyrAsn Tyr Gly Met Ala Leu Asn Glu Ser Phe Arg Gln Phe Gly 165 170 175 IleArg Pro Glu Val Tyr Arg Asn Gly Thr Leu Leu Ala Arg Asn Asn 180 185 190Arg Lys Pro Phe Met Gly Leu Thr Gly Tyr Val Thr Val Glu Thr Asp 195 200205 Gln Asn Thr Arg Val Phe Val Leu Ser Asn Arg Lys Ser Ser Glu Lys 210215 220 Gly Asn Ala Leu Arg Ile Leu Met Gln Phe Ala Trp Val Glu Gly Lys225 230 235 240 Leu Gln Ile Ser Leu Arg Asn Gly Ser Leu Ser Met Trp SerSer Arg 245 250 255 Gly Gly Ser Ile Pro Pro Ala Val Pro Ile Cys Gly PheAsp Gly Lys 260 265 270 Gly Cys Ala Ala Ser Val Phe Glu Met Tyr Lys GlyTyr Leu Leu Leu 275 280 285 Gly Ile Ala Leu Phe Val Val Thr Ile Ser GlySer Thr Phe Thr Val 290 295 300 Gly Phe Leu Ile His Ala Lys Phe Val GluGly Arg Arg Ser Asn Met 305 310 315 320 Ser Trp Lys Ile Pro Phe Ala LeuLeu Thr Lys Ser Lys Pro Lys Arg 325 330 335 Ala Asp Arg Thr Ala Ala AsnArg Ser Arg His Ser Val Arg Ser Asn 340 345 350 Gln Thr Asn Ile Ser SerLeu Thr His Ser Thr Ile Gly Ser Leu Ala 355 360 365 Arg Ser Arg Ile PheSer Leu Tyr Ser Tyr Asn Gly Glu Lys Cys Ile 370 375 380 Val Arg Ser PheGly Ser Thr Thr Met Ala Lys Ala Phe Thr Val Thr 385 390 395 400 Gln MetAla Glu Cys Arg Thr Met Arg Leu Phe Asp His Glu Asn Val 405 410 415 AsnArg Phe Leu Gly Leu Ser Leu Asp Gly Ala Asn Val Leu Ala Val 420 425 430Trp Asn Phe Cys Met Arg Gly Ser Ile Arg Asp Val Ile Leu Ser Glu 435 440445 Asn Ala Met Val Lys Asp Val Ile Phe Ile Gln Ser Ala Ile Lys Glu 450455 460 Ile Cys Glu Gly Ile His Phe Leu His Asn Ser Pro Leu Gln Phe His465 470 475 480 Gly Arg Leu Lys Ser Ser Ala Cys Leu Ile Asn Asp Arg TrpGln Val 485 490 495 Lys Ile Ser Tyr Phe Gly Leu Arg Trp Leu Lys Ser SerGln Lys Asn 500 505 510 Arg Ala Lys Asp Leu Leu Trp Leu Ser Pro Glu GlnLeu Arg Lys Met 515 520 525 Gly Asp Ser Glu Ile Val Glu Gly Ser Lys HisSer Asp Ile Tyr Thr 530 535 540 Met Ala Leu Ile Phe Thr Glu Met Val AsnMet Ser Pro Cys Trp Asp 545 550 555 560 Ser Ser Glu Ala Asp Gly Ala GluAla Asp Arg Ala Glu Asp Gly Glu 565 570 575 Glu Gln Asn Gly Thr Glu MetSer Arg Arg Lys Gln Thr Ala Glu Thr 580 585 590 Glu Gly Glu Thr Ala GlnArg Arg Pro Gly Arg Arg Ala Arg Gly Arg 595 600 605 Asn Ala Glu Glu IleAla Tyr Leu Val Lys Arg Gly Gly Ile Val Pro 610 615 620 Leu Arg Pro IleIle Arg Pro Ala Phe Asp His Leu Asn Thr Glu Val 625 630 635 640 Ile HisLeu Ile Arg Asp Cys Trp Val Glu Thr Pro Ser Glu Arg Pro 645 650 655 ThrIle Glu Lys Val Arg Gln Lys Leu Arg Gln Met Gly Ala Gln Arg 660 665 670Arg Val Asn Leu Met Asp His Val Phe Asp Met Leu Glu Gln Tyr Ala 675 680685 Asn Lys Leu Glu Glu Glu Val Gln Glu Arg Thr Lys Glu Leu Glu Gly 690695 700 Glu Lys Arg Lys Ser Asp Ile Leu Leu Tyr Arg Met Met Pro Arg Gln705 710 715 720 Val Ala Asp Arg Leu Lys Leu Gly Gln Ser Val Glu Pro GluGln Phe 725 730 735 Asp Cys Val Thr Val Phe Phe Ser Asp Ile Val Gln PheAla Ala Leu 740 745 750 Ser Asn Gln Met Arg Pro Leu Gln Val Val Asn LeuMet Asn Glu Leu 755 760 765 Tyr Thr Ile Phe Asp Ala Ile Ile Asp Glu HisAsp Val Tyr Lys Gly 770 775 780 Asp Gly Tyr Leu Cys Val Ser Gly Leu ProAsn Arg Asn Gly Thr Leu 785 790 795 800 His Ala Lys His Cys Ala Asp MetAla Ile Lys Phe Met Gln Ala Leu 805 810 815 Leu Asn Phe Arg Ile Pro AspLeu Pro Asn Glu Arg Val Arg Leu Arg 820 825 830 Ile Gly Leu His Ser GlyPro Cys Val Ala Gly Val Val Gly Leu Ala 835 840 845 Met Pro Arg Tyr CysLeu Phe Gly Asp Thr Val Asn Thr Ala Ser Arg 850 855 860 Met Glu Ser SerSer Ser Pro Asn Lys Ile His Met Ser Ser Glu Thr 865 870 875 880 Leu GluLeu Leu His Lys Asn Phe Asn Gly Ser Tyr His Thr Glu Ser 885 890 895 ArgGly Glu Val Ile Ile Lys Gly Lys Gly Val Met Glu Thr Phe Trp 900 905 910Leu Leu Gly Gln Val Glu Asn Gly Thr Thr Ile Asn Ala Asp Tyr Ala 915 920925 His Arg Met His Leu Pro Val Ile Lys Phe Gly Glu Glu Gly Asn Glu 930935 940 Thr Gly Lys Asn Ala 945 8 30 DNA Artificial sequenceOligonucleotide probe 8 agcggatccc gtccgcgcat ggacattgtg 30 9 30 DNAArtificial sequence Oligonucleotide probe 9 ccgctcgagc gttgcggcactcgcatttct 30

that which is claimed is:
 1. An isolated DNA encoding a nematode guanylyl cyclase chemoreceptor selected from the group consisting of: (a) isolated DNA having a nucleotide sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NO: 6; (b) isolated DNA that hybridizes to DNA of (a) above under stringent conditions and encodes a nematode guanylyl cyclase chemoreceptor; and (c) isolated DNA that differs from the DNA of (a) or (b) above due to the degeneracy of the genetic code, and encodes a nematode guanylyl cyclase chemoreceptor encoded by (a) or (b) above.
 2. An isolated DNA according to claim 1 selected from the group consisting of: (a) isolated DNA having a nucleotide sequence selected from the group consisting of SEQ ID NO: 4 and SEQ ID NO: 6; (b) isolated DNA that hybridizes to DNA of (a) above under stringent conditions in which said isolated DNA does not hybridize to DNA having a nucleotide sequence of SEQ ID NO: 1, and encodes a nematode guanylyl cyclase chemoreceptor; and (c) isolated DNA that differs from the DNA of (a) or (b) above due to the degeneracy of the genetic code, and encodes a nematode guanylyl cyclase chemoreceptor encoded by (a) or (b) above.
 3. An isolated DNA according to claim 1, which nematode guanylyl cyclase chemoreceptor is selected from the group consisting of order Tylenchida and order Aphelenchida chemoreceptors.
 4. An isolated DNA according to claim 1, which nematode guanylyl cyclase chemoreceptor is selected from the group consisting of cyst nematode, root knot nematode, lesion nematode, and reniform nematode chemoreceptors.
 5. An isolated DNA according to claim 1 having a nucleotide sequence according to SEQ ID NO:
 1. 6. An isolated DNA according to claim 1, having a nucleotide sequence according to SEQ ID NO:
 3. 7. An isolated DNA according to claim 1 having a nucleotide sequence according to SEQ ID NO:
 4. 8. An isolated DNA according to claim 1 having a nucleotide sequence according to SEQ ID NO:
 6. 9. An oligonucleotide that specifically binds to an isolated DNA of claim
 1. 10. An oligonucleotide according to claim 8, which oligonucleotide comprises DNA or RNA.
 11. An antisense oligonucleotide that specifically binds to an mRNA transcript of a DNA according to claim
 1. 12. A DNA that encodes an antisense oligonucleotide according to claim
 11. 13. A double-stranded RNA that is complementary to a DNA according to claim 1 and interferes with the expression thereof in a cell that expresses the encoded protein.
 14. An expression cassette comprising a DNA according to claim 1 and a heterologous promoter operatively associated therewith.
 15. A cell that contains an expression cassette of claim 14 and expresses said nematode guanylyl cyclase chemoreceptor.
 16. A cell according to claim 15, which cell is a yeast cell.
 17. A cell according to claim 15, which cell is a plant cell.
 18. A cell according to claim 15, which cell is an insect cell.
 19. An isolated nematode guanylyl cyclase chemoreceptor protein encoded by a DNA of claim
 1. 20. A protein or peptide that specifically binds to a protein of claim
 19. 21. A protein or peptide according to claim 20, wherein said protein or peptide is an antibody.
 22. A protein or peptide according to claim 21, wherein said antibody is a monoclonal antibody.
 23. A method of screening a compound for the ability to disrupt plant parasitic nematode feeding or chemotaxis, said method comprising: determining whether or not said compound selectively binds to a nematode guanylyl cyclase chemoreceptor protein encoded by a DNA of claim 1; the presence of such binding indicating said compound is useful in disrupting plant parasitic nematode feeding or chemotaxis.
 24. A method according to claim 23, wherein said determining step is carried out in vitro.
 25. A method according to claim 23, wherein said determining step is carried out in vivo in a cell culture comprising cells that expresses said guanylyl cyclase chemoreceptor protein.
 26. A method according to claim 26, wherein said compound is a member of a combinatorial library. 